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AN ENZYME EXHIBITING CELLULASE ACTIVITY 
FIELD OF THE INVENTION 

5 The present invention relates to an enzyme exhibiting cellulase 
activity, a DNA construct encoding the enzyme, a cellulolytic 
agent comprising the enzyme and a detergent composition 
containing the enzyme. 

10 BACKGROUND OF THE INVENTION 

Biomass which largely consists of cellulose, hemicellulose and 
lignin has attracted increasing attention as an important 
renewable source of energy (including nutritional energy) . The 

15 amount of carbon fixed by photosynthesis has been estimated to 
be lOOxlO 9 tons per year worldwide, and half of that is contained 
in cellulose. If this material, or at least a significant part 
of it, could be converted into liquid fuel, gas and feed 
protein, this would constitute a significant contribution to 

20- solving the problem of recycling and conservation of resources. 
However, it has been found difficult to develop an economically 
viable process of converting cellulosic material into 
fermentable sugars. 

25 The currently most promising of the suggested processes involves 
the use of enzymes which are able to degrade cellulose. These 
enzymes which are collectively known as cellulases are produced 
by a number of microorganisms, including fungi (e.g. Trichoderma 
reseei, Humicola insolens, Fusarium coQrsporum, etc.) and 
30 bacteria (e.g. Clostridium thermocellum , Cellulomonas spp., 
Thermonospora spp., Bacterioides spp., Microbispora bispora , 
etc.). The economics of the production of fermentable sugars 
from biomass by means of such enzymes is not yet competitive 
with, for instance, the production of glucose from starch by 
means of a-amylase due to the ineffeciency of the cellulase 
enzymes. The most significant problems connected with the use of 
cellulases is their low specific activity and the high cost of 
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their production. Therefore, there is a need to develop! 
cellulases which are more efficient in degrading cellulos ic ' 
materials into fermentable sugars. 

Apart from their utility for the degradation of biomass, 
cellulases have also been suggested for use in detergent 
compositions for the treatment of cotton-containing fabrics 
which largely consist of cellulose. It is well known that i' 
repeated washing of cotton-containing fabrics generally causes f 
a pronounced, unpleasant harshness in the fabric due to the I 
presence of amorphous regions in the cellulose fibres, which! 
regions form protruding parts on the otherwise smooth fibres. \ 
Several methods for overcoming this problem have previously been f 
suggested. For example, US 1.368.599 of Unilever Ltd. teaches f 
the use of cellulases for reducing the harshness of cotton- I 
containing fabrics. Also, US 4.435.307 (of Novo Industri A/S) £ 
teaches the use of a cellulytic enzyme derived from HumicoT „ I 
insolens as well as a fraction thereof as a harshness reducing I 
detergent additive, other uses of cellulases mentioned in the I 
art include* soil removal from and colour clarification of fabric f 
. (cf. for instance EP 220 016). I 



Although the use of cellulase enzymes for harshness reduction of 
cotton-containing fabrics was suggested and demonstrated nearly 

5 20 years ago the mechanism of this process has not been 
elucidated and is still not known in detail. Among other things, 
this is due to the multiplicity of the enzymes and the enzyme- 
catalyzed reactions involved. As a matter of fact, cellulases 
generated in nature e.g. by microbial species are indeed complex 

) mxxtures of cellulases. Accordingly, the conversion of natural- 
ly occurring materials, like cotton, catalyzed by cellulases is 
exceedingly difficult to analyze in detail. 

Due to these circumstances, the practical exploitation of 
cellulases for harshness reduction and prevention as well as 
colour clarification, however desirable, has not become 
widespread and of great practical utility: it is difficult to 
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optimise production of multiple enzyme systems and thus to 
implement industrial cost-effective production of cellulase en- 
zymes, and their actual use has been hampered by difficulties 
. : ; arising from the need to employ rather large quantities of the 
| 5 cellulases to achieve the desilred reduction and prevention of 

jent f the harshness of cotton fabrics: for instance, addition of large 

rics ! quantities of the enzymes to detergent compositions is not com- ■ 

patible with the optimal function of other ingredients in the 
detergent formulation nor is the addition of very large 
10 quantities of enzymes to the detergent composition in the 
interests of, e.g., consumer safety. 



The object of the present invention is therefore to provide 
cellulase enzymes with a high specific activity. 
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S) I SUMMARY OF THE INVENTION 

la I 

ig i The present invention relates to an enzyme which exhibits 

ie I cellulase activity, which enzyme is producible by a strain of 

c 20 Bacillus spp., NCIMB 40250, or by a related Bacillus spp. 

strain, or a derivative of said cellulase. The strain NCIMB 
40250 was deposited on 18 January, 1990, in the National 
p Collection of Industrial and Marine Bacteria, 23 St Machar 

r Drive, Aberdeen, Scotland, UK, in accordance with the provisions 

25 of the Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purpose of Patent Procedure. 

In the present context, the expression "enzyme exhibiting 
cellulase activity" is meant to be understood as an enzyme which 
3 0 is involved in the process of cellulose degradation. There are 
three different types of cellulases which act synergistically to 
produce soluble sugars: endoglucanases which show affinity for 
cellulose and which attack amorphous regions of low 
cry stall inity in the cellulose fibre resulting in the formation 
of free ends; exoglucanases which initiate degradation from the 
non-reducing chain ends by removing cellobiose units; and B- 
glucosidases which hydrolyse cellobiose to glucose. 
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The expression "related Bacillus spp. strain" is intended to 
indicate a strain belonging to the same Bacillus species as the 
strain NCIMB 40250 or a strain of a closely related species. The 
5 species to which the strain NCIMB 4 0250 belongs has been 
identified as Bacillus lauttis. The scope of the present 
invention is also intended at least to include cellulase enzymes 
producible by other Bacillus lautus strains than NCIMB 40250. 

10 The term "derivative" is intended to indicate a protein which is 
derived from the native protein by addition of one or more amino 
acids to either or both the C- and N-terminal end of the the 
native protein/ substitution of one or more amino acids at one 
or a number of different sites in the native amino acid 

15 sequence , deletion of one or more amino acids at either or both 
ends of the native protein or at one or more sites in the amino 
acid sequence, or insertion of one or more amino acids at one or 
more sites in the native amino acid sequence. 

20 Although the enzyme of the invention may be produced by 
cultivating the Bacillus spp. strain NCIMB 40250 or a related 
strain and isolating the enzyme from the culture, it will 
generally be more advantageous to produce the enzyme by 
recombinant DNA techniques which make it possible to optimize 

25 the yield of the enzyme produced. Furthermore, cloned genes | 
encoding the enzymes may be modified in order to provide enzymes 
with improved properties. 

Thus, in another aspect, the present invention relates to a DNA 
30 construct which comprises a DNA sequence encoding an enzyme 
exhibiting cellulase activity, which enzyme is derivable from a 
strain of Bacillus spp. , NCIMB 4 0250, or a related Bacillus spp. 
strain, or a derivative of said cellulase. The invention further 
relates to an expression vector which carries an inserted DNA 
35 construct as indicated above, as well as to a cell transformed 
with the DNA construct or with the vector. 



i 
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In a still further aspect, the invention relates to a 
cellulolytic agent capable of degrading amorphous regions of 
cellulose fibres, the agent comprising an enzyme exhibiting 
cellulase activity as defined above. 

The invention also relates to a detergent composition comprising 
the cellulolytic agent. The cellulase enzyme of the invention 
has surprisingly been found to be more stable during washing 
(for 60 minutes at 40*C) in the presence of conventional 
detergents than a commercial cellulase preparation (Celluzyme™, 
a cellulase preparation isolated from "' Humicol a insolensj 
available from' Novo Nordisk, A/S) . The cause of the increased 
stability may reside in the alkalophilic nature of the enzyme 
(see example 5 below) . It is further speculated that it may also 
be ascribed to stability towards oxidation or towards the 
proteases commonly included in detergents. If so, the celluase 
enzyme of the invention may also show increased storage 
stability in liquid detergents containing proteases. 

20 DETAILED DiS CLOSURE OF THE INVENTION 

The cellulase enzyme of the present invention is preferably one 
which exhibits endoglucanase activity (referred to in the 
following as an endoglucanase) , in particular one which exhibits 

25 an endoglucanase activity of at least about 10, more preferably 
at least about 20, most preferably at least about 25, such as 
about 30, CMC-endoase units per mg of total protein under 
alkaline conditions. The endoglucanase activity is determined as 
the viscosity decrease of a solution of carboxymethyl cellulose 

0 (CMC) after incubation with the enzyme of the present invention 
under the following conditions: 

A substrate solution is prepared, containing 35 g/1 CMC (Her- 
cules 7 LFD) in 0.1 M tris buffer at pH 9.0. The enzyme sample 
5 to be analyzed is dissolved in the same buffer. 

10 ml substrate solution and 0.5 ml enzyme solution are mixed 
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and transferred to a viscosimeter (e.g. Haake VT 181, NV sensor 
181 rpm) , thennostated at 40°C. [\ 

Viscosity readings are taken as soon as possible after mixing" 
5 and again 3 0 minutes later . The amount of enzyme that reduces • 
' the viscosity by one half under these conditions is defined as" 
1 CMC-endoase unit. J 

It should be noted that the endoglucanase of the invention is^ 
10 one which is active (in terms of CMC-endoase activity) underl 
-alkaline conditions. More specifically, the endoglucanase is one^; 
; - which has a pH optimum at a pH of about 7*5-10.5. Contrary t^ 
several known cellulases which are active at an acid pH and} 
/relatively inactive at alkaline pH values, this characteristic^ 
15 makes the endoglucanase of the invention particularly useful fori 
washing purposes, in particular as an ingredient of a detergent 
. . composition, as washing of clothes is typically conducted underi 
alkaline conditions due to the alkalinity of most washing 
detergents. Alkalophilic cellulases are known, e.g. from EP 27l| 
20 004, but they are not indicated to have a high affinity for^ 
cellulose, which is the case with the cellulase enzyme of the| 
viu present invention which also exhibits a higher specific! 
r activity. £ 



r. 



25 The enzyme of the present invention is preferably one which is! 
active at the temperatures at which clothes are typically^ 
washed, which is usually a temperature of up to about 60'C. 
Thus, the native enzyme isolated from strain NCIMB 40250 is^ 
active at temperatures between about 45 and 65 *C. This, however,^ 

30 does not preclude the possibility that the enzyme may, under 

certain conditions, be active at temperatures above 65 °C. 

J 

One enzyme according to the invention is an endoglucanase with" i [ 
an apparent molecular weight of 75 kD or a cleavage product j 
5 thereof exhibiting endoglucanase activity. The term "cleavage I 
product" is intended to indicate a shorter form of the enzyme 
resulting from, for instance, chemical or enzymatic cleavage 
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(e.g. by Means of a suitable protease) recovery of 

enzyme or from posttranslational processing by the „ ° „1 
produce the enzyme, e . g . „_ c . te ^ in y al ^J^"" 

wh 1C h may give rise to a mature form of the enzyme. Hpec^ 

sTkd BrooT f PrSSent PUIP ° Se ±S 8 Pr ° dUCt ° f *PP«ximately 
DNA T M CUltlvatin ' * organis, transformed with 

DNA encodxn, the -75 *D enzyme. The -75 *D enzyme (and its "58 
XD cleavage product) are referred to in th. f „i, 
10 as Sjjdoi. he foll °wi"g examples 

T h l ° f inVEntiOT be a " *»^™e encoded by 

the DNA sequence shown in the appended Sequence Listing lo.f 
(showing the sequence of the -75 XD enzyme) or a modit I 
- thereof encoding a derivative cf said enlogl'uclase 3tl ° n 

jf axiymg the DNA sequence coding for the na H vo 

2 » ^te 9 raTnucTTr ° f SUitablS of the b»a 

equence are nucleotide substitutions which do not give rise *„ 

another amino acid sequence of the endoglucanase, t ^ 
=or r esp ond to the Godon usage ^ ^ ■ *^ ™y 

<x * a dl ««ent amino acid sequence and 

•5 therefore possibly, a different polypeptide structurT witLut 

exTie": T iring toe properties °* <- 

examples of possible modifications are insertion of one or morl 
nucleotides into the sequence, addition of one or more 
nucleotides at either end of the sequence and deletion of one or 

:rfi n T° ti<,eS "* Either »* ~ ^ «- sequence, such 

LI „L c T ons of DKA coding for natlve proteins «• *>°™ 

and widely practised in the art. 

Another enzyme according to the invention is an endoglucanase 
vith an apparent molecular weight of 56 or a cleavage product 
thereof (as defined above, exhibiting endoglucanase activity 1^ 
example 2 below, this enzyme is referred to as En do2 
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The enzyme of the invention may be an endoglucanase encoded by 
the DNA sequence shown in the appended Sequence Listing ID#3 
(showing the sequence of the -56 kD enzyme), or a modification 
thereof (as defined above) ehcpding a derivative (as defined 
above) of said endoglucanase. 

A further enzyme according to the invention is an endoglucanase 
with an apparent molecular weight of 45 kD or a cleavage product 
thereof (as defined above) exhibiting endoglucanase activity . A 
specific example of such a cleavage product : is a protein of 
approximately 3 0 kD produced on cultivating a host organism 
transformed with DNA encoding the -45 kD enzyme. In example 3 
below, the -45 kD enzyme is referred to as Endo3A . 



The enzyme of the invention may be an endoglucanase encoded by 
the DNA sequence shown in the appended Sequence Listing ID#5 
(showing the DNA sequence encoding the -45 kD product), or a 
20" modification thereof (as defined above) encoding a derivative 
- (as defined above) of said endoglucanase. 



Other enzymes exhibiting endoglucanase activity produced from 
endoglucanase clone 3 (cf. example 3 below) are proteins of 
25 approximately 60 and 56 kD, referred to as Endo3B and Endo3C 
respectively. 



A still further enzyme according to the invention is an 
endoglucanase with an apparent molecular weight of 92 kD or a 

30 cleavage product thereof (as defined above) exhibiting 
endoglucanase activity. In example 4 below, this enzyme is 
referred to as Endo4. Other enzymes exhibiting endoglucanase 
activity produced from endoglucanase clone 4 (cf. example 4 
below) are proteins of approximately 74 and 71 kD. Either ot 

35 both of these may be individual enzymes or cleavage products of 
the -92 kD enzyme. 



r ■ 5 
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whose ^ nctl is to medlate bi ^ lng to cel z 0 :;:\: b *z?s 

U-e the cellulose-binding domain; thls corresponds If 
(Nakamura et al iqo,, „ ' aciI3,1s - gubtx] is 

; — or ^ ; D Z 7 ; s po a r ::r le ; the full - lensth - 7s *° 

, cellulose-binding JLT+LTZ ^LT ' rT^ 1 
off proteolyticallv leaving . core ^JT^.^TT 

u Presence o, the cellulose-binding dcain has been LnT'to be 
x»porta„t f or obtaining a colour clarification effect 

, prevashed textiles (cf . example 6 below, . " 

. cor region d^ 'CT ^J^JTT^ 

invenf derxved from an endoglucanase of the present 

:::::°f;r a a vzzzr tha — - - 

irom a celiuiase enzyme which does not, i n nature 

=: -^tt^^t- ana which *• 

a cel -Lulose-bxndxng domain derivpH -p*-™- 
endoolucanaco 4-u aerxved from an 

naogxucanase of the present invention. In this wav „ w 

possible to construct cellui a « * y be 

properties. <~llulase enzymes with improved binding 

0 

The DNA construct of the invention May be one which comprises a 

5 DreDarinfl a . NA or Wn, for xnstance obtained by 

preparxng a genomxc or cDNA n ~« 

SPP- strain ( e.g. ^^^^"^^J^ 
screening for DNA sequences coding for al^or pa^ of' £ 
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appropriate cellulase by hybridization using synthetic 
, oligonucleotide probes in accordanc with standard techniques 
(cf. Maniatis et al., 1982), 

5 The expression vector of the invention carrying the inserted DNA 
construct encoding the enzyme of the invention may be any vector 
which is capable of replicating autonomously in a given host 
organism , typically a plasmid or bacteriophage. In the vector, 
. the DNA sequence encoding the enzyme of the invention should be 
10 operably connected to a suitable promoter sequence. The promoter 
may be any DNA sequence which shows transcriptional activity in 
:: the host cell and may be derived from genes encoding proteins 
either homologous or heterologous to the host organism. The 
promoter is preferably derived from a gene encoding a protein 
15 homologous to the host organism. Examples of suitable promoters 
are lac of E.coli . daaA of Streptomyces coelicolor and amy L of 
Bacillus licheniformis. 



20 



The expression vector of the invention further comprises a DNA 
sequence enabling the vector to replicate in the host cell. 
- Examples of such sequences are the origins of replication of 
plasmids pUC19, pACYC177, pUBHO and pIJ702. 

The expression vector may further comprise a DNA sequence coding 
25 for a signal peptide in order to provide extracellular 
expression of the enzyme. The DNA sequence may, for instance, be 
the signal sequence from the a-amylase gene of JB^_ licheniformis . 

The vector may also comprise a selectable marker, e.g. a gene 
30 whose product confers antibiotic resistance, such as ampicillin, 
chloramphenicol or tetracycline resistance, or the dal genes 
from fi-suhti n « or B. licheniformis . 

The procedures used to ligate the DNA sequences coding for the 
35 enzyme of the invention and the promoter, respectively, and to 
insert them into suitable vectors containing the information . 
necessary for replication in the host cell, are well known to 
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persons skilled in the art (cf., for instance, Maniatis et al., 
op.cit. ) . 

The host cell of the present invention may be transformed with 
5 the DNA construct of the invention encoding the cellulase enzyme 
described above. In this case, the DNA construct may 
conveniently be integrated in the host chromosome which may be 
an advantage as the DNA sequence coding for the cellulase is 
more likely to be stably maintained in the cell. Integration of 
10 the DNA constructs into the host chromosome may be performed 
according to conventional methods, e.g. by homologous 
recombination . 

Alternatively, the host cell may be transformed with an 
15 expression vector as described above. 

The host cell used in the process of the invention may be any 
suitable bacterium which, on cultivation, produces large amounts 
of the enzyme of the invention. Examples of suitable bacteria 
20 may be grampositive bacteria such as bacteria belonging to the 
genus Bacillus, e.g. Bacillus subtilis . Bacillus lichen if o-rmi a r 
Bacillus lentus, Bacillus brevis r Bacillus stearothermophilus . 
Bacillus alkalophilus r Bacillus amvloliauef aciens . Bacillus 
coagulans, Bacillus circulans or Bacillus lautus r or 
25 gramnegative bacteria such as Escherichia coli . The 
transformation of the bacteria may for instance be effected by 
protoplast transformation or by using competent cells in a 
manner known per se . 

30 In a yet further aspect, the present invention relates to a 
method of producing a cellulase enzyme of the invention, which 
method comprises cultivating a host cell as described above 
under conditions conducive to the production of the cellulase or 
derivative thereof and recovering the cellulase or derivative 
5 thereof from the cells and/or culture medium. 

The medium used to cultivate the cells may be any conventional 
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medium suitable for growing bacteria. The cellulase may fc e 
recovered from the medium by conventional procedures including 
separating the cells from the medium by centrifugation or 
filtration, if necessary after disruption of the cells 
precipitating the proteinaceous: components of the supernatant or 
filtrate by means of a salt, e.g. ammonium sulphate, followed by 
purification by a variety of chromatographic procedures, e.g. 
ion exchange chromatography, affinity chromatography, or the 
like. 



In a particular embodiment of the method of the invention, the 
cellulase is recovered in mature form, either as a result of 
posttranslational processing of a proenzyme as explained above 
or as a result of appropriate modifications of the DNA sequence 
15 encoding the enzyme in the form of deletions of DNA 
corresponding to truncations in the N- and/or C-terminal: 
sequences of the enzyme. 



There is reason to believe that different cellulases may exert 
a synergistic effect with respect to the degradation of 
v . cellulose. The cellulolytic agent of the invention may therefore^ 
advantageously comprise a combination of two or more cellulase^ 
enzymes of the invention or a combination of one or more 
cellulase enzymes of the invention with one or more other 
25 enzymes exhibiting cellulase activity. Such cellulases may be 
endocellulases or exocellulases dependent on the intended use of 
the cellulolytic agent (e.g. the degree of cellulose degradation 
aimed at) . The other cellulases may be selected from those which 
may be isolated from species of Humicola such as Humicola 
30 insolens (e.g. strain DSM 1800) , Fusarium such as FusariuB 
oxysporum (e.g. strain DSM 2672), Mvceliopthora such as 
MvceliopthoT-a thermophile r Erwinia such as Erwini a j 
chrvsanthermis (cf. M.H. Boyer et al., Eur. J. Biochem, 16£r 
1987, pp. 311-316), Trichoderma such as Trichoderma reseej / 
35 Microbispora such as Microbispora bispora . Neocallimastix such | 
as Neocall imastix frontalis , Piromonas such as Piromoim g 
communis, Robillarda spp., Cellulomonas such as Cellulomona g 
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fimi, Clostridium such as Clostridia themo^nn. , Pseudn.n^c 
spp. , Thennonospora spp. , Bactericide s PP . or Ruminocn^ spp . 

The cellulolytic agent of the invention may suitably be in the 
5 form of a non-dusting granulate, stabilized liquid or protected 
enzyme. Non-dusting granulates may be produced e.g. according to 
US 4,106,991 and 4,661,452 (both to Novo Industri A/S) and may 
optionally be coated by methods known in the art. Liquid enzyme 
preparations may, for instance, be stabilized by adding a polyol 
10 such as propylene glycol, a sugar or sugar alcohol, lactic acid 
or boric acid according to established methods, other enzyme 
stabilizers are well known in the art. Protected enzymes may be 
prepared according to the method disclosed in EP 238,216. 

15 The cellulolytic agent may suitably exhibit an endoglucanase 
actxvity of 500-10,000 CMC-endoase units (as defined above) per 
gram of the agent. The cellulolytic agent is suitably a 
detergent additive which may comprise one or more other 
enzymes, such as a protease, lipase and/or amylase, 

-0 conventionally included in detergent additives. 

The detergent composition of the invention comprising the 
cellulolytic agent described above additionally comprises one or 
more surfactants which may be of the anionic, non-ionic, 
cataonxc, amphoteric, or zwitterionic type as well as mixtures 
of these surfactant classes. Typical examples of anionic 
surfactants are linear alkyl benzene sulfonates (LAS), alpha ' 
olefin sulfonates (AOS) , alcohol ethoxy sulfates (AES) and 
alkali metal salts of natural fatty acids. 

) 

The detergent composition of the invention may contain other 
detergent ingredients known in the art as e.g. builders 
bleaching agents, bleach activators, anti-corrosion agents,' 
sequestering agents, anti soil-redeposition agents, perfumes, 
enzyme stabilizers, etc. 

The detergent composition of the invention may be formulated in 



WO 91/10732 



PCT/DK91/00013 



14 

any convenient form, e.g. as a powder or liquid. The enzyme 'm^y 
if required, be stabilized in a liquid detergent by inclusion of 
enzyme stabilizers as indicated above. Usually, the pH of a 
solution of the detergent composition of the invention will be 
5 7-12 and in some instances 7.0-10.5. Other detergent enzymes 
such as proteases, lipases or amylases may be included in the 
detergent composition of the invention, either separately or ia 
a combined additive as described above . 

10 The softening, soil removal and colour clarification effect^ 
obtainable by means of the cellulase enzyme ...of the invention" 
generally require a concentration of the cellulase in the" 
washing solution corresponding to an endoglucanase activity of 
5-200 CMC-endoase units per liter. The detergent composition 

15 . _ of the invention is typically employed in concentrations of o.sf 
. /- 20 g/1 in the washing solution. Consequently, the cellulase'! 
concentration of the detergent composition of the invention is ' 
about 0.3 - 400 CMC-endoase units per gram. In general, it ism 
most convenient to add the detergent additive in amounts of 0^1 

20 - 5 % w/w or, preferably, in amounts of 0.2 - 2 % of th|& 
detergent composition. For special applications, however, fo|{ 
instance when the detergent composition is to be used for colour! 

clarification or harsness reduction of fabric which has been! 

M 

damaged by repeated washing, it may be convenient to include |f 
15 much larger amount of the cellulolytic agent, such as about 2pjH 
w/w. . : $m. 

In a still further aspect, the present invention relates toM 
method of reducing the rate at which cellulose-containii??! 

0 fabrics become harsh or of reducing the harshness of cellulosejj 
containing fabrics, the method comprising treating a cellulose| 
containing fabric with a cellulolytic agent as described above| 
The method of the invention may be carried out by treating 
cellulose-containing fabrics during washing. The cellulolyti| 

5 agent may either be added as such in the amount required t& 
obtained the desired effect, or it may be added as an ingredient 
of a detergent composition as described above. However, & 
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desired, treatment of the fabrics may also be carried „ • 
soaring or rinsing or simply by ^ ^ J^^^ 
water xn which the fabrics are or will be immersed " ° 

The cellulolytic agent of the invention may also be emm , 
obtain colour cl a rif^ a f ~ employed to 

clarification of cellulose-containing fabric 
After repeated washing, such fabrics often develop 
appearance. This effect is particularly evidenl wlth T* 
fabrics, especiallv dar-v ^ • evident with coloured 

^ undyed parts or t^e c f T^' ^ ^ 

the ceL^^ It^Z ^ S beC °^—t when 

by mechanical forces The d , " m da ^ ed 

rorces. The damaged parts of the -FiW^~ 

below, . ""uiose-binaina domain ( cf . exa-ple 6 

by treafin^ ■• , invention may be carried out 

«y treating cellulose-containiner f.K^v, • 

r n !rng . celIulo - — e—rrr 

such ln the M|olmt required to obtalnea 

ae S : r a L: t ed as an ln9reaient ° f a aet — ~»po"T m 

*ay Lso V . H ° WeVer ' " deSlred ' *™*— t °f the fabrics 

TdLf 1 Carried ° Ut dUr1 " 3 SMkin ' ° r rinsing °r si»p ly by 

^^Lr^tS :cr t clarl " cation p ™- 

norB Y suirabl y exhibit an endoglucanase activity of 

LTL ^ mC - end — Per liter ot the agueous 

» M y furthermore be possi ble to e* pl o y a ceHulolytic agent 
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according to the invention to provide a localized variation i n 
the colour of a fabric to impart a "stone-washed" appearance t 0 
the fabrics (for the use generally of cellulase enzymes for thi s 
purpose, see for instance EP 307 564) . 

The cellulolytic- agent of the Invention is also contemplated to 
be useful in the field of paper pulp processing , e.g. p m p 
drainage .(for the use generally of cellulase enzymes for this 
purpose, see for instance EP 262 040) , as well as for deinking 
of paper intended for recycling (for the use generally 0 f 
cellulase enzymes for this purpose, see for instance JP 59-9299 
or JP 63-59494). 

The invention is further illustrated by the following examples 
15- which are not in any way intended to limit the scope of the] 
invention, with reference to the appended drawings, wherein ^ 

Fig. 1 Restriction maps of the Bacillus spp. DNA inse 
contained on the plasmids in the 4 endoglucanase- 

2 9 ; positive E.coli clones. The position and direction of 
. - : . transcription of endoglucanase l (Endol, pPL517) and 

endoglucanase 2 f Endo2 . pPL382) is indicated. Restri 
tion enzyme sites are indicated as follows: PstI (P^ 
Hindlll (H) , Sroal. (S) , Sail (Sa) , BamHI (Ba) , Balflj 
25 (B) , SphI (Sp) , EcoRI (E) . n indicates pBR322 DNA 

Fig. 2 Restriction maps of different plasmids carrying tW 
Endol gene f l p . The ^-lactamase promoter of pBR32i 
is indicated by the arrow (Pb) and the direction of ti* 

30 transcription of the Endol gene is indicated by t3*M 31 
arrow (Pb) . (■■—■) : pBR322, (■■): "tail" of pBR3j 
encoded amino acids. Restriction enzyme sites a* 
indicated as follows: PstI (P) , Ball! (B) , EcoRI , (E| 
Clal (C) . The endglucanase activity in extracts ° 

35 E.coli MCI 000 containing the indicated plasmids 

shown to the right as (A) cellulase units/ml cult^ 
medium. 
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Fig. 3 Effect of temperature on the cellulase (Endol) in 
extracts of E.coli MC1000 ( P PL517) and on the multiple 
cellulase activities in the supernatant of Bacillus 
sep_. PL236. The activity was measured at the tempera- 
ture indicated after an incubation period of 30 min. 
See Materials and Methods. The activity at the dif- 
ferent temperatures is presented as cellulase units/ml 
of the original culture volume. 

Fig. 4 Relevant restriction sites and sequencing strategy of 
the- endoglucanase Endol indicated as endocellulase l 

in the figure. (J * ): extent and direction of 

sequence reactions. Abbreviations:- EcoRi ( E) , Pst i 
(P), Bglll ( B g), BstNI (Bs) , Aval (A) , Clal (C) ,' BamHI 
(Ba). 

Fig. 5 is a restriction map of plasmid pDN 2801. Restriction 
enzyme sites are indicated as follows: EcoRI(E), Bglll 
TBg), Hindlll (H), Smal (Sm) , Sail (sa) , sphl (Sp) 
Psti (P )f Eagi (Ea), Clal (C) , BamHI (B) . CAT indicates 
the gene mediating chloramphenicol resistence. P 
indicates the Bacillus maltogenic a-amylase promoter ." 

25 Fig. 6 Restriction maps of different plasmids (B.sub£ilis 
replication origin) carrying the Endol gene. The 
maltogenic alpha-amylase promoter is indicated by the 
arrow (P m ) , which also indicates the direction of the 
transcription of the gndol gene; (mm): PDN2801, 
(■): tail of PDN2801 encoded sequences. Restriction 
enzyme sites are indicated as follows: Eagi (Ea) , Bqlii 
(Bg), Psti (P), BamHI (B).The endocellulase activity 
in extracts of B. subtil i s DN 1815 containing the 
indicated plasmids is shown to the right (A) as 
endoclucanase units/ml culture medium. 
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Fig. 7a Construction of pTL05 and pLA03. ( 0 ) : represents tfc 
and b C-terminal "tails 11 encoded by vector sequences. 

indicates the expected C-terminal cleavage site- 
Restriction sites are abbreviated as follows: PstI (pj : 
5 Pvu (Pv) , BamHI (B) , Balll (Bg) , Sail (S) . 'J. 

Fig. 8a Construction of pCH57. ( ^VVsJ ) : Signal sequence of the 
and b alpha-amylase. ( jltMUHl) : signal sequence of the Endoi 
glucanase ( BBi )• "direct repeat" e.g. start of tte 
10 mature Endol gene. ( • ) ribosome binding site of th| 

: alpha-amylase. (O) ribosome binding site of the Endo 
glucanase (M^) : alpha-amylase promoter. Restriction 
sites are abbreviated as follows: PstI (P) ; sail (S)) 
Konl (K) ; Eaal (Ea) ; Balll (Bg) . , , j 

15 

Fig. 9 Relevant restriction sites and sequencing strategy of 
the endoglucanase Endo2 (indicated as endocellulase 2 

. > t in the figure). ( 1 ^ ): extent and direction of 

sequence reactions. Abbreviations: Seal (Sc) , Saci 
20 , CSa), Xmnl (X), Hindi! (He), Hindlll (H) , PstI (P) .cf 

. Fig- 10 "Zymogram showing the molecular weight of the activi] 
: proteins resulting from different plasmids carrying the 
original endoglucanase 3 clone (pPL591) as well as] 
25 deletions in the original insert. 4 

Fig. 11 Restriction maps of the endoglucanase 3 clone (iii: 
dicated in the figure as endocellulase 3) (pPL591) anij 
plasmids containing deletions in the original insert^ 
30 The position of the endoglucanase genes Endo3A , EndfiS? 

and Endo3C r on the insert in plasmid pPL591 predicted 
from the data shown in the zymogram (Fig. 10) is show? 
on the restriction map. Restriction enzyme sites ar* 
indicated as follows: Hindlll (H) , Smal (S) , EcoRI (E)*j 

35 
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EXAMPLES 

MATERIALS AND MTr-r^po 



10 



a) Bacterial fitr^n S ar ,d p i acTO ^ g i 

The donor strain, Bacillus ^ strain PL236, was isolated from 
a compost sample from Lyngby, Denmark, on the basis of its hiah 
cellulolytic activity. A sample of this strain was deposited In 
18th January, 19 90 in the National Collection of Industrial and 
Marxne Bacteria, 23 St. Machar Drive, Aberdeen, Scotland, with 
the accession No. NCIMB 40250. 



The following E. coll strains and plasmid were used: MCiooo 
■ (arapjj391, (ara, leu) 7697, lacX74, salu, galK, rpsL) (casaba- 
15 dan et al., 1980) ; CSR603 (E-, thr^l, ^uB6 f ^ ^ 
B-Al, argE3, thi^l, uvrA6, lam# ^ ^ 3 ~j 

£PsL3l, tsx^33 # -, suEMl) (Sancar et al., 1979); PL248~7s" 
MC1000 containing the plasmid pNF2690 which contains the 
replxcatxon origin and the kanamycin resistance gene from 
^0 PACYC177 (Chang and Cohen, 1978) and the cI857 repressor genl 
from the coliphage lambda; p B R322 (Bolivar, 1977); pUN l 2 i 
(Nxlsson et al., 1983 , pUC i 8 (Yanisch-Perron et al., 1985)- 
PPLC28 (Remaut et al., 1981); p PL 170 (Jorgensen, P.L., 1983);' 

5 For the experiments with B.. sub±ilis, the following S . subtilis 
straxns.and plasmids were used: DN1885 (amyg, amyR, s^g!, pro!) 
(Dxderichsen, Novo Industri A/S) is a derivative of b. subtil 
168; PL1801 is a derivative of DN1885 lacking the two main 
exoproteases (apr, npr, ; pD N2801 has the origin of replication 
from pUBlio (Keggins et al., 1987) , the Cat gene of P ci94 
(Horxnouchi and Weisblum, 1982) and the maltogenic alpha- 
amylase promoter (Pm) from B. stea^hp^^,. (Diderichsen 
and Chrxstiansen, 1988) followed by a polylinker; The £. subtil- 
As/E.coli shuttle vector pJKK3-l is described by Kreft et al 
(1983); pPL1759 has the origin and kanamycin resistance gene of 
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pUBHO and the promotor, ribosome binding site and signal- 
sequence from the alpha-amylase gene from B. licheni-Ff^ ^l 
(Stephens et al., 1984). 

b) Media 

Phosphoric acid swollen cellulose (ASC) was prepared froa^ 
chromatography cellulose (MN 300, Machery, Nagel) as describeil 
by Walseth (1952) with the exception that the cellulose povdell 
was suspended in acetone before treatment with phosphoric acid§ 
The medium for detection of cellulase activity was prepared a§ 
standard m) -medium (Maniatis et al., 1982) containing 0.2% AScI 
or microcrystalline cellulose . (Avicel , Merck) 

15 Bacillus spp. , B. subtil is arid E.coli cells were grown in N*§ 
medium (von Meyenburg et al. f 1982), LB medium (Maniatis et aYl% 
1982) , or BPX medium (100 g/1 potato starch, 50 g/1 barlejf 



flour, 0.1 g/1 BAN 5000 SKB, 10 g/1 sodium caseinate, 20 g/ 
soybean meal; 9 g/1 Na 2 HP0 4 , 12H 2 0, 0.1 g/1 Pluronic) . 

The media were solidified by the addition of agar (20 g/1) 



m 



Tetracycline (20/ig/ml) , or kanamycin (10 pg/ml) were added as§ 
required. 



c) Isola tion of DNA i| 
To isolate the chromosomal DNA from Bacillus spp . PL236, cells* 
from 250 ml overnight culture were resuspended in 10 ml (50 nH| 
Tris-HCl, pH 8.0, 100 mM EDTA) , and incubated with 25 ig* 
lysozyme for 20 min. at 3TC. To the mixture was added 2 ml of| 
10% (w/v) SDS, mixed and put on ice for 10 min. To the solution! 
was then added 15 ml of phenol saturated with TE-buf f er ( 10 mlf| 
Tris-HCl, pH 8.0, 1 mM EDTA), heated to 65 *C, mixed gently andj- 
cooled on ice. After centrifugation for 30 min. at 40000 g the^ 
aqueous phase was ether extracted, ethanol precipitated and tbe| 



REFLACEMENTSHEET 



WO 91/10732 



PCT/DK91/000I3 



■fv' 



10 



21 

pellet was resuspended in TE-buffer. The DNA was further 
1982) fled ^ band±ng ^ 3 CSC1 densit y gradient (Maniatis et al., 

E,coli plasmid DNA was prepared by the SDS lysis method 
(Maniatis et al., 1982) ; ^preparations of plasmid DNA for 
restriction enzyme analysis and transformations were prepared 
according to Holmes and Quigley (1981) . fi.s^jbfcilis plasmid DNA 
was prepared by the alkaline lysis method. (Maniatis et al 
1982). * ' 



15 



20 



15 



, d) Cloning of chromosomal DNA from B ac mn g ~ r r PTi ^ fi , n ^ \, 

col J. ' - 

Ena^rt 0 ",^ 21 ™" " li9aSS " ere obt ^ned fro* New 

England Bxolabs and used as described by the manufacturers. 

toe e n„f 9eSt t° n Witt ££ "' 82261 " restriction ^ 

the DNA was heated to 65'c for 10 min. and ethanol precipita- 
ted. 10 „ of linearized P BR322, pJXKa-i or pDH121 ancJ 20 of 
fragmented £. sep. chromosomal DNA were ligated with 5 units of 
DNA Ugase in a final volume of loo „1 (16 h, 15-C). The ligated 
DNA was used to transform competent E..^ MC1000 to 

BW S ™ " " Elstan « — entiai ly as described by Handel and 
Bxga (1970). (PBR322: 20 „ g/ i, P OT12l: 7 ^g/i and pjKK3-l: io 
/ig/ml) . 

, e) Detection of cenmase- POS1 >W 0 r ; . co]l . ff1 wp „ ■ 

usL deteCti r. ° f CellUlaS6 aCtivit ^ - Pl-t« was performed 
using a modification of the technique of Teather and Wood 
(1982) . E.coH clones were grown overnight on solid NY medium at 
37 -c. Cells were lysed by overlaying the colonies by topagar 
containing phosphate buffer (100 mM, pH 7.0) , agar (0.7%, CMC 
(0.2%, , SDS (0.25 mg/ml), an d chloramphenicol (200 /i/al) and 
subsequent incubation overnight at 37 "C. Plates were then 
flooded with an aqueous solution of Congo red (1 mg/ml) for 15 
axn. and subsequently washed with 1 M NaCl. Cellulase-positive 
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colonies were surrounded by a yellow halo on a red background 
f) Maxicells % 

Plasmid-encoded proteins were analysed using the maxicell metti^ 
of Sancar et al. (1979) with the following modification. After? 
UV irradiation, the surviving cells were killed by incubatin^ 
the cells with D-cycloserine (150 fig/ml) for 48 h at 37 *c. ~'M 
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a) " Gel electrophores j s "7f| 
( 35 S)-L. methionine labelled maxicell proteins and other proteins^ 
were analysed by electrophoresis on 15% (0.075 per cex^j 
bisacrylamide) SDS-polyacrylamide gels (Laemmli, 1970) . Proteins 
were visualized" either by staining with Coomassie Blue G 250 oil 
by autoradiography. 



■2$r 



Analysis of DNA was done by electrophoresis on agarose gels witfif 
the buffer described by Loening (1967) . 



ft) Detection of cellulas e activity in polyacrvlami de gels 
... Detection ^ of - cellulase activity in protein bands separated byj 
SDS-polyacrylamide gel electrophoresis was done by a modified 
zymogram technique described by Beguin (1983) . Protein prepara 
tioris were electrophoresed on a SDS-polyacrylamide gel a: 
described above and the gel was washed 3 times 30 min. 
phosphate buffer (100 mM, pH 7.0) layered on to a thin (0.8 mnj 
agarose gel (agarose, (1.8 per cent), CMC, (0.2 per cent)! 
phosphate buffer, (100 mM, pH 7.0)), and incubated for 4 h aC, 
42 «C. Cellulase activity was visualized by staining the agarose| 
gel for 30 min. in an aqueous solution of Congo red (l mg/nil)| 

followed by washing the gel in 1 M NaCl. -,| 

"Mil 

m 

i) Colo rimetric cellulase assay 
Cellulase activity in cell extracts was analysed by measuring! 
the increase in reducing groups released by the hydrolysis °*J 
CMC (Miller, 1959) . An appropriate amount of enzyme was i n f§| 
cubated with 1.5 ml of 1 per cent CMC in phosphate buffer (1^1 



10 



15 



20 



25 



30 



35 



REPLACEMENTSHEET 



WO 91/10732 



PCT/DK91/0O013 



23 



10 



15 



20 



5 



mM, pH 7.0). After 30 min. of incubation at 55*C, 1.5 m i of 
dinitrosalicyclic acid reagent was added and the samples were 
boiled for 5 min. The absorbance was read at 550 nm against 
blanks containing equivalent amounts of extract from the E.cpli 
recipient strain. One unit of cellulase released 1 nM of glucose 
equivalents per second by reference to a standard curve. 

j ) DNA-Seauenm' p g 

Single end labelled DNA fragments were isolated and sequenced by 
the chemical modification method (Maxam and Gilbert, 1980) . The 
cleavage products were separated on 8% or 20% polyacryl-amide 
gels and thereafter autoradiographed at -70 'C using intensi- 
fying screens. 

The dideoxyribonucleotide method of Sanger et al., (1977) was 
used for the sequencing of Endo3A using derivatives of pUC18 
(Yanisch-Perron, 1985) . 

£J Southern' analy sis 

Chromosomal DNA from Bacillus spjK (PL236) was digested with 
restriction enzymes as required and fractionated on 1% (w/v) 
agarose gels. DNA was then blotted onto nitrocellulose filters. 
32 p -labelled DNA probes (recombinant plasmids) were prepared by 
nick translation (Rigby et al., 1977) using 32^ (Amersham) 
and hybridization was carried out as described by Southern 
(1975). Autoradiography was performed at -70 -C using intensi- 
fying screens. 

X) Transformation of co m petent n. subtil is cells 
A modified version of Dubnau and Davidof f-Adelson • s (1971) 
procedure for preparing competent cells of B. subtil is is used. 
10 ml of LB-medium is inoculated with the strain in the morning. 
7 hours later sequential dilutions in KM-l-medium are made and 
incubated overnight at 37 - C . The following morning, the second- 
most diluted and growing culture is diluted ten times in KM-2- 
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medium. The cells are harvested after 45-60 minutes of 
incubation by centrifuging for 3 minutes at 7K. They ate 
resuspended in 1/10 volume of the supernatant and 1/50 volume of 
86% glycerol is added. 0.1 ml amounts are frozen on liquid 
nitrogen and stored at -80'C. ' 

In order to transform these competent cells, the method 0 f 
Ehrlich (1986) is used, with some modifications. BTF i s 
prepared and preheated to 42 'C. 0.01 ml of DNA is placed in a i 
reaction vessel, and the competent cells are thawed at 42 *C. BTF? 
is added to the cells at a ratio of 1:1; and 0.1 ml of th3 
mixture is added to the DNA. The cells and DNA incubate with! 
shaking for 20 minutes at 37 'C. A further 30 minutes of genlfj 
expression with 0.1 ml of NY-medium is needed, if kanamycinf 
resistance is desired. The cells are finally spread on relevanf 
plates. Recipes for stock-solutions are as follows: T 
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Stock solutions for making B.subtilis competent cells 
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Salt mix: 



10 mM CaCl 2 , ImM FeCl 3 and ImM MnClil 



25 



30 



35 



10 X MM : 



KM-stock : 



KMl-stock : 



KM2-stock: 



20 g (NH 4 ) 2 S0 4 , 60 g 
K2HP<V3H 2 0 in 1L. 



KH 2 P0 4 , 



140= 



SJSl 



-8 

10% Na-s 



100 ml 10 X MM, 10 ml 
citrate, 2 ml 1M MgS0 4 in 1L. 

960 ml KM-stock, 20 ml 20% glucosff 
1 ml 20% casamino acids, 20 ml 
yeast extract, 30 1 ImM MnCl 2 in 

960 ml KM-stock, 20 ml 20% glucos 
1 ml 20% casamino acids, 20 ml; 
yeast extract, 1 ml salt mix, 



mm: 



ft' A < 
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BTF : 
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25 

0.5 M CaCl 2 , and 0.8 ml 1M MgCl, ± n 
1L. 2 

100 ml ib x MM, io ml Na-citrate, 2 
ml 1M Mgso 4 , i ml saltmix, 20 ml 20% 
glucose in il. 

800 ml BCG, 100 ml lOmM egta, ioo mM 
Mgci 2 in il. 

(Prepared fresh before use) . 




■« £. it z^z ° oding for en2yraes 

,v cenuiolvtic ba ^!" ar ° Una «*«■»*-. Several 

■ these 1^ ! " ere " 0latea - ° ne **• —t active of 

» -^C' " ^ d ° n0r ** tte = loning 

-xi-jrocrystallme cellulose fAvie^i m«v-„i,\ 

the r>T^,-~ (Avicel, Merck) on agar medium. At 

the optimum temperature of growth (42'C) th. 
0 appeared in 2-3 days. ' clearing zones 

: sioL n ir a11 am ° UntS ° f *»• PL236 to suspen- 

sions of *icrocrystalline cellulose makes the cellulose crys^Ts 
lump together tightly and sediment. crystals 
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This suggests that either the Bacillus spp. PL23 6 cells or the 
extracellular cellulase enzymes have a strong affinity for the 
cellulose substrate and tightly bind the cellulose crystals 
together . 

SCREENIN G ASSAY FOR RECOMBINANT CLONES 

The first attempts to clone cellulases from Bacillus spp . PL236^ 
were directed towards the endocellulases genes from Baci 1 w V 
see. PL236. To facilitate the screening on plates of endocellu4 
lase positive clones, an assay using the dye Congo red (Teathe^l 
and Wood, 1982)- was adapted to E.coli. , 

In order to detect cellulase activity trapped inside th|f 
15 recombinant E.coli cells the cells were lysed by adding SDS tdf 
.the. top agar. This modification of the top agar had no measuref 
able effect on the CMC-degrading enzymes of Bacillus see.pl236 
when the modified assay was used on this organism. 

20 MOLECUIA R CLONTNG OF ENDQCELLUIASES 

Several endocellulase genes were cloned from the cellulolytiff 
Bacillus see, strain (PL236) . An endocellulase 1 clone (PL517J 
was made by ligating Pstl partially digested PL23 6 chromosomal 
25 DNA with PstJ cleaved pBR322 and subsequently transforming 
competent E. coli cells. 

An endocellulase 2 clone (pPL382) was made by ligating Hindllll 
partially digested PL236 chromosomal DNA with Hindlli cleave* 
3 0 pJKK3-l (an E.coli/B. subtilis shuttle-vector) and subseguentll 
transforming competent E. coli cells. 

An endocellulase 3 clone (pPL591) was made by ligating EceBlf 
partially digested PL236 DNA with EcoRI cleaved pUN121 
35 subsequently transforming competent E. coli cells. 
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partially digested PL236 DNA with Hiaa ^ pUNxTTl^ 
subsequently transforming competent E.coli cells 

5 

, Y " th^r tetracycline 

resistance (P BR3 22: 20 g/nl , pDH121: , ^ an<J ^ 

The transformers were replicated to another set of plates and 

ZlllT ^ • thS a ° dlrled ^ tOP ^ «» -~ £ 

cubated overnight and stained with Congo red as described above. 

Under the reisolation procedure it was observed that it was 
possible to detect positive clones without adding the cell 
Ivzing agent SD s to the topagar, although the diameter of the 
halo was considerably smaller . 

mnwnTffli A W p ChTOTrrFFTriTTOK n„ ^ F „ - rT| - rr - r 

and^ \ T Cellula «-P=="ive transformants were isolated 
and analyzed With restriction enzymes. Restriction enzyme maps 

IwnTnTigtT PlaS " idS fr °" f ° Ur ClOTi - e — - « 

To determine the molecular weight of the cloned endoglucanases 
a zymogram technique (Beguin, l 98 3, was used. Total protein 

JIT*??™ frOT representati - endoglucanase clones were 
separated on a SDS-polyacrylamide gel. The proteins in the gel 
were then renatured by washing out the SDS and replicated onto 
an agarose gel containing CMC Renatured proteins diffuse to 

attlvit ^ Pr ° telnS endoglucanase 

were 1 ' T* ^ ^ *" ^ Th * »^-anase bands 
were then Visualised by staining the activity gel with Congo red 
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as described above. 



As appears from Fig. l, the restriction maps of the cloned DNA - 
(as well as the molecular weight of the endoglucanases encoded 
by the cloned DNA) are different ] ±n the four different clones" 
This indicates that these four clones represent at least four 
different endocellulase genes from Bacillus spp . (PL23 6) . M 

The detailed analysis of the different endocellulases represen—S 
ted by these four clones is described in the followinq. 

• ' ■ " ' .,M 

Example 1 - H| 

Endocellulase 1 (Endol) ^ 

Physica l mapping of endocellulase gene 1 ^ 
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In the cloning experiment described above several cellulase-j 
: positive jS.coli clones were obtained, which contained different^ 
fragments of PstJ partially cleaved Bacillus spp . PL236 DNA.l 
Fig. 2 shows the restriction maps of some of these clones. The|,, 

20 plasmids invariably contained two PstI fragments (1000 and 135cS 
bp.) indicating that both were necessary for synthesis of af 
polypeptide with cellulase activity: Nucleotide sequencing hasl 
later shown that the 500 bp. PstI fragment present the on pPL217/|| 
and pPL517 (Fig. 2) contains the C-terminal part of the completed 

25 cellulase gene (data are presented in the following) . The|| 
structure of this part of the B. spp . chromosome was confirmed by;1 
Southern analysis (data are presented in the following) . 



Activity measurements 



"Af 



Extracts from the E.coli clones containing pPL212, pPL2i6,| 
PPL517 and pBR322 were prepared from overnight cultures grown in|| 
NY medium supplemented with tetracycline. Cells were|| 
concentrated 10-fold in 100 mM phosphate buffer, pH 7.0. DNase; 
35 was added and the cells were ruptured by passing them twi&J$ 
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through a French Press (1200 0 lb/in) . The extracts „ 
centrifuged for 60 nln . at 40000 X a and thJ ******** Were 
assayed f or ce llul ase activity The Lxa-^el T^Tl ^ 

ikiooocppx™,. E .^ MC1000(PPL517)i and M cro 0 : P s 

contained 8 units i ■> ^ uu IP"* 16) 

total cellulase activity .„ these < L s °f 



5 



The high level of, cellulase activity in extracts of • 

thTr-i:: t transcription ° f the — ~ lg £s^» 

* Pilules* Pr0 " 0ter on PBR322. it was concluded that £T 

» cellulase xs expressed in the direction shown in Fig. 2. 

Measurements of the viscosity of a OK- ■=„,„<-• 
Maxicell and ^^an aww1 y B< . 

The molecular weight of the endoalucanase protein was analysed 
by the maxxcell technique. The plasmids pPL212 BPt21(i ! 

^:^rrTx into tte aaxice11 

D and 58000 o In addition to t, P ° lyPeptldeS ° f 75 °°° *• "000 
At^ * ■ addition to the proteins encoded by PBR322 

Apart from the P BR322 proteins no »r-^ • PBR322. 
m-^., _ ^ ins ' no proteins encoded bv the 

Plasmid pPL216 could be detected «- ^ , 

- ii ulase gene . _ th r^:r \z ™^ 

* comparison of the cellulase activity bands and the hands on 
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10 



the autoradiogram from the polyacrylamide gel showed that the 
75000 D and at least one of the 58000 and 65000 D protein bands 
had cellulase activity. It was also found the supernatant of the 
Bacillus see. PL23 6 culture contained at least three different 
proteins with cellulase activity, bne of these proteins emigra- 
ted with the 58000 D protein synthesized in the maxicell. Cell 
extracts from E.coii MC1000 (pPL212) only revealed one active 
band comigrating with the 58000 protein from the maxicell E.coii 
CSR603 (pPL212) . As mentioned previously, the following sequence 
data showed that the plasmids pPL2l2 and pPL216 did not contain ' 
the entire Endol endoglucanase gene. The 75000 D active protein 
seen in the maxicell £.coli CSR603 (pPL212) is thus a fusion ■ 
protein, where 105 C-terminal amino acids are encoded by pBR322 i 
sequences (Fig. 2). This fusion protein is apparently posttrans-1 
lationally processed, ending up with the 58000 D mature endoglu^ 
canase. Cell extracts from E. coii MC1000 (pPL217) and E.coliJIl 
MC1000 (pPL517) , which contain the complete endoglucanase gene'^rf 
also gave activity bands of M r approx 58000 D and 75000 D. TheW 
75000 D active -protein from the piasmids pPL217 and pPL517, thusj' 
0 represents the -genuine" initial translation product from theflll? 
Endol glucanase gene, which apparently is processed more slowlyjT 
than the fusion protein from pPL2i2. However, both the 75000 DlH 
fusion protein synthesized from pPL212 and the "genuine" 7500& 
D protein synthesized from pPL517 and pPL217 are processed down'; 
to a 58 000 D protein with high cellulase activity. 



15 



5 



mi 



"I 



From the sequence data it can also be predicted that th* J 
endoglucanase expressed from pPL216 is synthesized a fusion 1 
protein, where the 2 6 C-terminal amino acids are encoded by*$ 
PBR322 sequences (Fig. 2) . This protein was not detected by the^l 
maxicell technique and zymograms using extract from PL216 only *H 
revealed one active band of 58000 D, which most likely repre^jf 
sents the processed protein. The processing in E. coii of the two' J 
fusion proteins from pPL212 and pPL216, which represent two j§ 
different lengths of the C-terminal "tail" thus results in both^l 
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case, in an active protein of appro* ssooo n ™ • • 

that t he endoceZiuiase is process Z tr J^ZlllT"^ 

. *- oV^ 9000 D - which would eas ^ *~ — 

eQ as a Proenzyme which at least in v ™o ■ 
possibly also in a is moai «-«U (ana 

product, the 58000 D activttv b " Processing 

CPPL212) ejrtr.rt. a =tivity band present in S-S31X MClooo 

PL236. -Pematant o f cultures at BstSillSS sap. 
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20 



25 
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Temperature onti™^ - nd ^ 



The celluiase activitv of 4- 

for , ^ " y of the extracts was measured at rt^ 

i l^zg. 3). The heat stabilitv of t->,« 

™- s a ° s ^1:7^ r rioas and res±auai « 

Wahest activitv tas ? """i" 1 * «* "then* the 

tile of 30 n In ^ tte en ^ " ^ ' 

with a , ; » " r r inaotivate<i at *^-r.*»» 

«*« ,\ oVin^Ticn 0 . C 81,3 ^ -/Nation was observed 



The nucleotide sequence of the endocellulase gene 1 , Endon , 
deduced from the plasmid P PL5 17 wh ,^ «. . (£D ^ i) Was 

pFL5i7 which contains approx. 2850 bp 
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of Bacillus spp . PL236 DNA. 

The sequence wasa determined by the chemical modification method 
(Maxam and Gilbert, 1980) using the partial restriction map and 
5 the sequencing strategy outlined in Fig. 4. 

.The complete nucleotide sequence is shown in Sequence listing 
1D * 1 - A computer analysis of this sequence revealed only one 
open reading frame long enough to encode the approx. 75000 d 
protein detected by the maxicell and zymogram analysis of Of 
extracts from the cellulase-positive E.coli MClbob (pPL5l7) . This '< 
sequence which -begins at nucleotide 677 and ends at nucleotide % 
2776, encodes an enzyme of 700 amino acids. The M r calculated I 
from the DNA sequence was 77006 D. 

Within the °P en reading frame there were three potential initia- I 
tion codons (ATG at positions 677, 737 and 749), but only the S 
ATG codon at position 677 was preceeded by a ribosome binding S 
site (AAGGAGGX (Mclaughlin et al., 1981). it was .therefore f 
0 concluded that the ATG codon at position 677 was the correct °| 
initiation codon. ff : 
- . •- , 

The initiation codon is followed by an amino acid sequence which f 
resembles signal sequences found in gram-positive organisms, f 
5 such sequences consist of a relatively short hydrophilic region 
at the N-terminal followed by a longer sequence of hydrophobic 
residues. 



15 



) 



By using the signal sequence cleavage model proposed by Heijne 
(1983) the cleavage site can be predicted to be between the two 
first alanine residues in the sequence Asn-Ala-Ala-Ala . The 
signal sequence is thus 31 amino acids long. 

The upstream and downstream regions contained no significant 
homology to the consensus sequence of the sigma 43 promoter of 
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;| - £• subtilis and no terminator-like sequences. 



Southern analysis 

5 The Bacillus spp . (PL236) chomosomal DNA was digested with 
fijLndlll, PstI, EcbRI and Xhol and plasmids pPL212 and pPL509 
were used as probes for the hybridization. Plasmid pPL212 
contains two PstI fragments (1350 bp. and 1000 bp.) and plasmid 
PPL509 contains only the 500 bp. PstI fragment of the entire 
10 Endol gene, represented by the plasmid pPL517, - which contains 
three PstI fragments (1350 bp., 1000 bp. and 500 bp.) of 
Bacillus see. PL236 DNA. The pPL212 probe recognized the 
expected two PstI fragments (1350 bp., 1000 bp.) and the pPL509. 
the 500 bp. PstI fragment in the Bacillus spp . PL23 6 PstI 
15 digest. Both the pPL2l2 and the pPL509 probe also recognized the 
same overlapping EcoRI fragment and the same overlapping HindlTT 
fragment in Bacillus spp. PL236, EcoRI and Hindlll digest. These 
results indicate that the Bacillus spp . PL236 DNA insert in 
pPL517 was cloned in a non-deleted form and that the three PstI 
fragments in pPL517 are continuous on the Bacillus spp . PL236 
chromosome. 

Expression of the Endol gene in B. subtilis 

For the cloning experiments in B. subtilis . pPL517 was used as 
the donor of the Endol gene and pDN2801, carrying a strong 
Bacillus promoter P m , was used as the Bacillus vector (Fig. 5). 

The Endol gene-containing Eaal fragment was ligated to Eaal 
cleaved pDN2801 and by subsequent transformation to compentent 
B. subtilis cells (DN1885) , strain CH7 was obtained. To test 
whether the processed C-terminal part was necessary for the 
expression of the Endol gene in B. subtilis cells, a construc- 
tion was made where the Endol gene was fused to vector sequen- 
ces in the internal Bglii site. This fusion replaces the coding 
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region for the C-terminal 94 amino acids with 55 "random" amino 
acids encoded by vector sequences. 

Similar constructions made in E.coli vectors, though fused to 
different vector sequences, resulted in an active periplasmic 
endoglucanase in E.coli, which was processed in the "correct" 
manner. Part of the Endol gene contained in the Ball! fragment 
from pPL517 was ligated with BamHI cleaved pDN2 8 01 and sub- 
sequent transformation to competent B. subtilis cells (DN1885) 
resulted in strain CH14 . 

- Trans formants ..were in both cases screened for their chloramphe- 
nicol resistance, and the desired plasmid constructions in the 
strains CH7 and CH14 by restriction analysis of their plasmids J 
The two versions of the Endol gene on the plasmids pCH7 and! 
pCH14 are thus transcribed from the same promoter P . aI* 
.restriction map of the plasmids is shown in Fig. 6. 'xM 

• • '- ^1, 

: : The B. subtil Is DN1885 used for these experiments produces ;an| 

endoglucanase of its own, which of course gave some background 

activity. The Endol gene product was exported to the culture 

medium from the recombinant strain CH7, and the activitf; 

measured in the culture supernatant was approximately 20 timesf 

higher than the background activity (Fig. 6) . No extracellular 

activity above the background level was detected from 

recombinant strain CH14 which contains the Endol gene with 

substituted C-terminal. 

The culture supernatant s from the strains CH7 and CH14 and c 
extract from CH7 cells were analysed by the zymogram technique-^ 
The zymogram revealed active protein bands of approx. 7500C&J 
and 58000 D from the CH7 cell extract and only one actiV^ 
protein band of approx. 58000 D from the CH7 culture superna- 
tant. These bands correspond to those observed in E. coli and 
processing of approx. 90 amino acids from the C-terminal appe 
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to take place in B. subtilis too. 

The plasmid pCH7 was transformed to PL1801, which is a der- 
.. ivative of DN1885 lacking the two main exoproteases (apr",npr" 
5 ) , resulting in the strain CH14 . The Endol cellulase as produced 
- from CH14 was processed "normally" indicating that the two main 

exoproteases from B. subtilis are not responsible for the c- 

terminal processing of the Endol cellulase. 

10 A very weak active band of approx. 58000 D was detected from the 
CH14 culture supernatant, indicating that the manipulated gene 
is expressed and processed in at least almost the same way as 
the native gene product. Among other things, the very low 
expression from pCH14 and the fact that the two genes are 

15 expressed from the same expression signals may indicate that the 
approx. 90 C-terminal amino acids are necessary for the export 
of the Endol gene product from B. subtilis . 

Optimization of expres sion of the Endol gene in E. col i 

20 

In order to optimize the expression of Endol . the Endol gene was 
combined with the strong E. coli promoters P R and P L originating 
from phage lambda (Remaut et al., 1981). Both promoters are 
repressible by the lambda cI857 repressor, which is heat labile, 
25 thus rendering the P R and P L promoters heat inducible, in cells 
producing the lambda CI857 gene product. (Ptashne et al. , 1982) . 

The P R promoter is contained on the expression plasmid pPL170 
together with the lambda CI857 gene. (Fig. 7; Jergensen, 1983). 

0 

The P R promoter was placed upstream of the Endol gene by ligating 
the p R containing Pvul - Sail fragment from pPL170, to the Endol 
gene containing Pvul - Sail fragment. Transformation to 
competent MC1000 cells resulted in the strain TL05 containing 
5 the plasmid pTL05. In the plasmid pTL05 the /3-lactamase promoter 
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is deleted, thus bringing the Endol gene under transcriptional 
control of the P R promoter. At this point, the Endol gene was 
believed to be contained within the Balll fragment from pPL2129. 
The Endol gene fusion to vector sequences on pPL212 was 

..therefore transferred to pTL05, resulting in a fusion protein 
where 105 C-terminal amino acids are encoded by vector 

. sequences. This fusion protein is however processed correctly as 
shown earlier with the strain PL212. The cellulase production 
from TL05 is completely repressed at 28 *C arid induced at 42 *c. 

The P L promoter provided on the plasmid pPLc28 i(Remaut et-al., 
1981) was combined with the Endol gene by ligating the Ball I 
fragment from pPL2l2 to BamHI cleaved pPLc28. Transforming to. 
competent PL248 cells, which are harbouring the lambda cI857 
gene on a compatible pACYC177 based plasmid (pNF2690) , resulted 
in the strain LA03, containing the plasmid pIA03 (Fig. 7). The 
Endol gene is thus fused to vector sequences, but due to 
unspecified DNA sequences in pPLc28 the length and nature of the 
resulting fusion protein is unknown. The cellulase production 
from LA03 was completely repressed at 28 *C and induced at 
40.5-c. 



The cellulase production from IA03 and TL05 was evaluated at 
different temperatures. 

IA03, -TL05 and PL212 were grown overnight at 28 'C in NY medium 
supplemented with the appropriate antibiotics (AMP + KAN, KAN 
and TET, respectively) . For each strain the overnight cultures 
were diluted 100 fold in NY medium (AMP + KAN, KAN and TET, 
respectively) , and the diluted cultures were grown at different 
temperatures between 2 8'C and 42' C. Cells from each culture were 
harvested at OD 450 = l and lysed on a French Press, and the 
activity in the extracts was determined as described earlier. 

35 LA03 which exhibited the highest cellulase production was unable 



REPLACEMENTSHEET 



WO 91/10732 



PCT/DK9I/00013 



37 

to grow at temperatures above 4 0.5°C. The experiment was 
repeated without antibiotic selection pressure in the diluted 
cultures. Similar results were obtained, but LA03 grew very 
slowly at temperatures above 4 0.5°C. ; However, this growth was 
followd by a significant loss of the plasmid pIA03. No signifi- 
cant loss of pIA03 at temperatures up to 4 0.5 *C or pPL212 and 
pTLOS at any temperature, was observed. 

Optimization of expression of the Endol gene in B . subtil is 

In order to optimize the expression of the Endol gene in 
B. subtilis . the Endol gene was fused to the expression-signals 
(promoter, ribosome binding site and signal sequence) from the 
alpha-amylase gene from B. lichenif ormis , which is expressed in 
high amounts in B. subtilis . 

pPI.1759 contains the promoter, ribosome binding site and most of 
the signal peptide of the B. lichenif ormis alpha-amylase 
(Stephens et ai., 1984) . The downstream side of this region ends 
with a PstI site, which again is followed by a polylinker (Fig. 
8). Between the PstI and the Sail site in the polylinker of 
PPL1759, a synthetic DNA fragment consisting of two comple- 
menting oligonucleotides creating PstI and Sail "sticky" ends 
was inserted. In the resulting plasmid pCH52 the synthetic 
linker reconstitutes the missing part of the signal peptide of 
the alpha-amylase and further encodes the first 14 N-terminal 
amino acids of the mature Endol cellulase (Fig. 8) . The linker 
thus creates a hybrid signal peptide cleavage site between the 
alpha-amylase and the cellulase. The expected cleavage site is 
shown in Fig. 8. From pPL»517 the Endol gene was excised without 
promoter on an Eaal fragment and inserted into the unique Eaal 
site in pCH52. The plasmid in which the Endol gene was inserted 
in the correct orientation was named pCH54 . 



PCH54 contains two direct repeated sequences of 45 bp (e.g. the 
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45 N-terminal base pairs of the mature Endol gene) which may 
recombine, deleting the region between them (Ehrlich et al., 
1986). This recombination event, however, occurs with a very low 
frequency when the repeat is as small as 45 bp. In order to 
enrich the amount of plasmid that has recombined, a plasmid 
preparation of pCH54 was cut with the enrichment restriction 
enzyme Konl . pCH54 contains a unique Konl site between the two 
direct repeats and only non-recombinant plasmids are cut with 
Kpnl, while recombinant plasmids stay circular. When B. subtilis 
(DN 1885) was retransf ormed with this mixture, transf ormants 
were mostly (90%) containing recombinant: plasmids, since 
B. subtilis competent cells are not transformed with linerized 
plasmid DNA. The recombinant plasmid was called pCH57 and is 
contained in the strain CH57. The structure was confirmed by 
restriction analysis, but the gene fusion was not confirmed by 
DNA sequencing. In this construction pCH57 the Endol gene is 
thus perfectly fused to the alpha-amylase expression signals. 

The endoglucanase is produced extracellularly from the B. 
subtil is strain CH57, indicating that the hybrid signal cleavage 
site is functioning. The secreted Endol endoglucanase is 
processed to the expected M r , namely 58000 D. 

The production of the Endol cellulase from CH57 was evaluated in 
two different media, NY (overnight at 37 *C) and BPX (7 days at 
37 •C)". The BPX medium is a very rich medium in which the 
nutrients are slowly released, thus keeping the cells in an 
early stationary fase for several days during fermentation. The 
alpha-amylase expression signals function particularly well in 
this medium. The results appear from Table 1 below. 
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Table 1 



STRAIN 


PLASMID 


U/ml 


U/ml 






NY 


BPX 


CH7 


pCH7 


6.5 


55.0 


CH57 


pCH57 


13.0 


• 325. 0 


DN1885 




0.3 


40.0 



Analysis of the culture supernatant (BPX-medium) on PAGE 
revealed a dominant (90%) endoglucanase band corresponding to a 
concentration of endoglucanase in the supernatant of approx. 0.5 
g/L. 



Example 2 

Endocellulase 2 f Endo 2 ) 
DNA-seouence 

The nucleotide sequence of endocellulase 2 (£ndo2) was deduced 
from the plasmid pPL3 82 which is described above. The plasmid 
contains approx. 2500 bp. of Bacillus spp . FIi236 DNA. The 
sequence was determined by the chemical modification method 
(Maxam and Gilbert, 1980) using the partial restriction map and 
the sequencing strategy outlined in Fig. 9. 

The complete nucleotide sequence is shown in Sequence Listing 
ID#3. A computer analysis of this sequence revealed only one 
open reading frame long enough to encode for the approx. 5 6000 
D protein detected in the zymogram analysis of the extract from 
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the cellulase positive clone E.coli MC1000 (pPL382) . i>hi 
sequence begins at position 172 and ends at 1869 and encodes an 
enzyme consisting of 566 amino acids. The calculated M r is 62551 
D which is slightly higher than the M r of 56000 D determined by 
zymogram analysis. This difference could be due to inaccuracy i n \. 
the zymogram analysis or to post-translational processing beyond 
the expected processing of the signal peptide. The atg 
initiation codon in position 172 was selected because it was the 
only initiation codon within the open reading frame, which was 
proceeded by a ribosome binding site AAGGAGG (Mclaughlin et al.~s 
1981). ' - '.' 



10 



This initiation codon was followed by a signal seguence-like\V 
sequence, and by use of the signal sequence cleavage model' |f 
proposed by Heijne (1983), the cleavage site could be predicted 8 ftll5 
to be between the two alanine residues in the middle of the^ 
sequence Leu-Ala-Ala-Ala. The signal sequence of the Endol2 is jflj 
thus 30 amino acids long. 



mi 



'5 : 



20 The region upstream of the open reading frame contained - a^BlJO 
sequence homologous with the sigma 43 type promoters of B.subt^^Jf? 
ills (Johnson et al., 1983) at position 46-75. e '"~ 



This sequence consists of TTTACA as the -35 region and TATTAT as-^j3 
5 the -10 region; the two are separated by 18 nucleotides. : ' : ^|§|K 

A palindromic repeat sequence of 13 bp. was found downstream of" 
the termination codon at position 1956-1981, which seems to^ 
resemble a rho-independent terminator (Rosenberg and court » n |Jl 
0 1979). , 8$-^ 

Southern analysis 

The Bacillus S pp . (PL236) chromosomal DNA was digested with || 
> Sindlll, Pstl. EcoRI and Xhol and the plasmid pPL382 was used as 
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a probe for the hybridization. The hybridization pattern 
obtaxned confirmed that the Bacillus m . PL236 DNA was cloned 
xn non-deleted fota, that the two Hindlll fragments from P P L3 82 
was continuous on the Bacillus sppj. PL236 chromosome and that 
the Endo2 gene was different from the other cloned endoglucana- 
ses . 



Expression of Bndo2 in B.suhj-il 



xs 
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Plasmid P PL382 was transformed to B.subtilis DN18S5 to achieve 
secretxon of the .mature Endo2 product, b. subtilis DN1885 
(PPL382) was grown aerobically in 640 ml LB-medium containing 10 
/*g/»l tetracycline for 30 hours. The supernatant was 
concentrated by precipitation for 24 hours with (NH^so, at 70* 
saturation. After 5 hours of dialysis against 100 mM Tris-HCl P H 
7, the concentrated supernatant was heated to 55'C for 15 
mxnutes. Denatured protein was removed by centrxfugation and the 
soluble proteins were subsequently precipitated with (NHJ.so, at 
•70% saturation. The resolubilized proteins were dialyzed against 
100 mM Tris-HCl pH 7 with a final volume of 2 ml and were 
apPlxed to a 80 cm x 1 cm gel filtration column containing 
Ultrogel AcA 44 (lkb) . Active fractions were pooled 
concentrated and applied to a SDS containing polyacrylamide gel,' 
where the endoglucanase appeared as a single band at 56-kDa. The 
activity yield of the method was approximately 10%. 

The endoglucanase comigrates with the endoglucanase obtained 
from extracts of MC1000 (pP L3 82, as detected by Zymogram 
analysxs xn the supernatant of DN1885 ( P PL382) . The endogluca- 
nase activity of DN1885 (pPL382) is about 25 times that of 
straxn DN1885 when grown in NY medium. 
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EXAMPLE 3 
Endocellulase 3 

5 Physica l mapping of the endocellulase clone 3 i 

The endocellulase clone 3 is represented by the plasmid pPL59i 
which contains a 11000 bp. EcoRI fragment of Bacillus spp . PL236 
DNA. A partial restriction map of this plasmid is shown in Fig. 
10 10. Zymogram analysis of extracts from E. coli MC1000 (pPL59i) ci 10 
shows that the insert gives rise to four proteins with cellulase 
activity. The approx. M r of these proteins were 60000 D, 56000 
D, 45000 D and 30000 D (Fig. 10). 



15 Southern analysis showed that the EcoRI fragment from pPL591 was c£ 
cloned in a non-deleted form from the Bacillus spp . PL236 - 
chromosome, and indicates that the DNA did not contain the DNA- % 
sequences encoding the Endol and Endo2 genes. 

2 0 -To analyse whether these proteins represented post-trans- °| 
lational processing products from one or several cellulase '% 
genes, deletion plasxnids vere made using the restriction enzymes 
pindlll and Smal. Deletion of the 4800 bp. Smal fragment, 
resulting in the plasmid pPL538, did not eliminate any of the 

25 four cellulase bands on the zymogram. The B. sdp . DNA insert on 2i 
PPL591 contained 5 Hindlll sites. Digestion of pPL59l with 
ffjndlll thus gave 6 fragments of 6600 bp. (vector fragment), 
3200 bp., 1700 bp., 1550 bp., 1350 bp, and 900 bp., where the 
1700 bp. fragment had originated from the Hindlll site within 

30 the pUN121 vector plasmid (Fig. 11) . 

Elimination of all of the Hindlll fragments except the 1700 bp. 
fragment (reinserted in the opposite direction) resulted in the 
plasmid pPL540. Removal of the rest of the Bacillus spp . PL236 
35 DNA by eliminating the EcoRT fragment from pPL540 resulted in 
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the plasmid pPL587. Both the E. coli MC1000 (pPL540) and the 
£.coli MC1000(pPL587) were cellulase-positive and zymogram 
analysis of extracts from these clones revealed only the 45000 
D and 3 0000 D proteins. The protein-coding capacity of the 
Bacillus spe. PL236 DNA (approx. 1500 bp.) is approx. 55000 D, 
which is too small to contain two endoclucanase genes of 45000 
D and 30000 D. The 30000 D protein on the zymogram is thus most 
likely a result of immature post-translational processing of the 
45000 D protein. The cellulase gene encoding the 45000 D protein 
was designated Endp3A. The Endo3A gene was cloned in both direc- 
tions on the plasmids pPL587 and pPL538 giving rise to the same 
two proteins, thus eliminating the chance of the protein being 
a fusion protein. 

Elimination of the 1550 bp., the 1350 bp. and the 900 bp. 
fragment resulted in the plasmid pPL542. Extracts from E. coli 
MCl000(pPL542) revealed cellulase positive proteins of approx. 
30000 D, 45000 D, 49000 D and 56000 D (Fig. 10). Prom these 
preliminary results, the existence of two additional endogluca- 
nase genes within the original insert on pPL591 and pPL53 8 are 
postulated. The additional endoglucanase genes are designated 
Endo3B and Endo3C. Their postulated position on the Bacillus 
see. PL23 6 DNA is shown in Fig. 11. The postulated model is 
based on the assumption that the 60000 D protein made from 
PPL538 and pPL591 is converted to a truncated fusion protein of 
49000 D made from pPL542 where the HindlTT fragment of 1350 bp. 
is deleted. 

DNA-seo uence of Endo3A 

The DNA sequence of the Endo3A gene was deduced from the plasmid 
PPL54 0 containing approx. 1500 bp. of Bacillus spp . PL236 DNA 
using the dideoxy chain termination method. The gene was placed 
in pUC18 in both orientations, and a number of deletions were 
constructed. Standard primers were used except for one synthe- 
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tic oligonucleotide that was used for sequencing a regio n 
no practical restriction sites. 



■m 



The C-terminal part of the gene was deduced from the pl as J 
PPL538. The partial .DNA-sequence 1 is shown in Sequence Listf 
ID#6. The sequence revealed an open reading frame coding f 0 *J| 
protein with a M r of about 62000 D which is in agreement with t|f 
observed protein of 60kD in the zymograms. The ATG start cod; 
(position 30) is preceded by a typical ribosome binding s £ 
(Mclaughlin et al. , 1981). The initiation codorv is followed .„ 
a typical gram-positive signal sequence and by using the sighlf 
sequence cleavage model (Heijne, 1983) a signal sequence of' 
amino acids is revealed. s 



15 EXAMPLE 4 



Endocellulase A 



Zymogram analysis 



:0£ f 



The endocellulase clone No. 4 is represented by the plasnlf 
pPL592, which contains approx. 14000 bp. of Bacillus spp . PL^f! 
DNA. A partial restriction map is shown in Fig. 1. 

A zymogram analysis of extracts from E. c'oli MC1000 (pPL592« 
revealed three cellulase active proteins with M values offl 
approx. 92000 D, 74000 D and 71000 D. Further analysis - is |j| 
necessary to determine whether these proteins are encoded by one|ff 
or several cellulase genes. 



" for 

JI?The 

II?*" 

Att: 



I*h< 




m 



Southern analysis confirmed the origin of the cloned DNA on the >|| 
Bacillus see. PL236 chromosome, and indicates that the cloned 
DNA is not represented on the other endoglucanase clones. 



35 
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EXAMPLE 5 

A. Endol cloned and expressed in Bacillus subtilis 

An agar slant was inoculated with B^. subtilis strain CH 57 and 
incubated for 20 hours at 37'C. 10 ml of a 0.9 % NaCl-solution 
was added to the test tube which was shaken to suspend the 
cells. The cell suspension was used to inoculate a 2 1 
fermentor. 

The following parameters were used to run the fermentation: 



Temperature: 37 'C. 
Aeration: 1,1 1/minute. 
Stirring: 1100 rpm. 

Fermentor: A 2 1 model with a working volume of 1,5 1. 

The pH was maintained between 6.2 and 7.2 for the first 40 hours 
of fermentation. After that the pH was maintained between 6.7 
and 7.2. The pH was maintained within this range by dosing with 
NH 3 and H 3 P0 4 . 

Dosing of a glucose solution was initiated after 40 hours at a 
flow rate of 3.7 ml/hour. 
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Substrate 

Potato starch degraded with Termamyl* 
Soybean meal 
5 Corn steep Liquor ' 
Alburex (potato protein) 
(NHJ 2 S04 
KH 2 P0< 

Na 2 HPO A . 2H 2 0 
10 Water added up to 1100 ml. 



20 



25 



30 



50 g 
110 
16.5 
27.5 
2.2 
1,2 
5,9 



*Termamyl is a commercial L licheniformi R a-anylase avalabl, 
from Novo Nordisk A/S . 



15 Glucose solution 

600 ' g 



Glucose. H z O 



Citric acid 

0, 6 g 

Water added up to 1000 ml 

, The fermentation was stopped after 166- hours of fermentation at 
an . OD 65o value of . 122. At that time there were 1200 ml of 
fermentation broth in the fermentor. 

The fermentation broth was centrifuged, and the extracellular 
volume was 400 ml containing 40 CMC-endoase units per ml (16.000 
CMC-endoase units in all). The culture medium was further 
processed by filtration and dilution followed by concentration 
on an Amicon ultrafiltration module with a cut-off at 10.000 mm. 
The concentrated enzyme solution was frozen. 



Half of the frozen liquid was thawed and diluted with deionized 
water and then concentrated once more on an Amicon 
ultrafiltration module. The total yield was 4471 CMC-endoase 
5 units (from 8000 CMC-endoase units) . 
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The total volume of 1050 ml was subjected to ion exchange 
chromatography at pH 7 . The enzyme was bound to a DEAE- 
Sephacryl anion exchange column (3 00 ml volume) at pH 7 (50 mM 
tris-HCl) . The Endol enzyme was eluted at pH 7 with 0.3 M NaCl. 

The purified enzyme has a molecular weight of 58 , 000 D on SDS- 
PAGE. The pi is 4.0. Its activity is 3 0 CMC-endoase units per mg 
protein. 

The protein determination is based on the amino acid composition 
of the enzyme deduced from the DNA sequence: 13 tryptophan, 3 0 
tyrosine and a molecular weight of 57,566 D. The extinction, 
coefficient is calculated by means of the following formula: 

( 13 x 5559 + 30 X 1197 ) / 57566 = 1.88 

The purified enzyme has an endoglucanase activity of 57 CMC- 
endoase units- per ml and an absorbance at 280nm of 3.6. 
Thus, (57 x 1.88)/3.6 = 30 CMC-endoase units per mg protein. 

B. Stability of Endol in detergents 

The following 4 detergent compositions were used: 

1. USA liquid detergent: 2 gram per liter of 6* hardness water 
( 1 part tap water to 2 parts deionized water) . The pH was 
measured to 7,29, 

2. USA Heavy Duty Powder detergent: 0,9 gram per liter of 6° 
hardness water. The pH was measured to 9,2. 

3. Heavy Duty Powder detergent (2) with bleach and activator: 
0.12 gram/1 Na-perborate tetrahydrate and 0,088 gram/1 NOBS . The 
pH was measured to 9,2. 



WO 91/10732 



PCT/DKPl/aoo^ 



48 



4 . European Heavy Duty Powder detergent with bleach "and 
activator (Batch DR 8806 Europe). 5 gram per liter in" 9. 
hardness water. 



Cellu2yme™ (batch CAX 007 crude enzyme with cellulase and other 
enzymes) 2353 CMC-endoase units per gram was compared with £ndb^ 
3 0000 CMC-endoase units per gram. - v;i 

The enzymes were diluted to 3 CMC-endoase units per ml in all ^ 
detergent solutions: The endoglucanase activity after dilution? 
was measured as described above (by determining the decreaseliS 
the viscosity of CMC) . The endoglucanase activity after *60 : 'raijn^ 



incubation at 4 0 # C was measured and compared with the ini'tlalf 



activity. The following results were obtained: 
Detergent solution 



Celluzyme™ 
Endol 



12 3 4 



89% 75% 66% 75% 



106% 90% 97% 98% 



The standard deviation is 10%. 



2dt£ 



It appears from the table that Endol is more stable at a pHj 
9-10 compared with Celluzyme™ in these detergents. 

EXAMPLE 6 



w 



A. Prep aration of full-lenath r-75 kD) Endol in R. sTibtiliS ^fe 



10 1 of LB medium containing 1 mM CuCl 2 and 10 
chloramphenicol was inoculated with 10 ml of an ovefni9|| 
culture of JL. subtil is DN969 (B. Diderichsen et a*'* 
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; J. Bacterid. 172(8), 19 90/ pp. 4315-4321) containing the plasmid 
pCH7 (described above in example 1) , divided among 10 sterile 2 

. 1 flasks and incubated with vigorous shaking for 36 hours at 
37 'C. The culture was centrifuged for 10 minutes at lOOOOxg and 

. 4*C after which EDTA, pH 8, was added to the supernatant to a 
final concentration of 5 mM. 

25 g of Avicel PH-105 which had been hydrated in ethanol and 
washed with destilled water was added to the supernatant which 
was left standing with gentle stirring for 2 hours at 4'C. The 
supernatant/Avicel mixture was centrifuged for 10 minutes at 
lOOOOxg and 4'C. The cleared supernatant was decanted off 
immediately after the rotor had stopped. 

The Avicel/ enzyme cake was resuspended and washed in 200 ml of 
lmM EDTA, and the mixture was centrifuged for 1 minute at 
lOOOOxg. This procedure was repeated twice. The Avicel/ enzyme 
cake was then resuspended in 150 ml (1% triethylamine and 1 mM 
EDTA) and was left standing with vigorous stirring for l hour at 
4-C. The mixture was centrifuged for 1 minute at lOOOOxg and 
4-C. The supernatant was retained. This procedure was repeated 
twice. 

The solution of enzyme and triethylamine (about 3 00 ml) was was 
evaporated in vacuo to 100 ml. The temperature of the solution 
was not allowed to exceed 10 'C. The pH was adjusted to 7 by 
adding 1M HCl, and the solution was frozen at -70'C. 

The amount of protein in the 100 ml enzyme solution was 
determined to be 40 mg by means of a Bradford reagent (available 
from BioRad) using bovine serum albumin as the standard. 

B. Characteris ation of the -75- kD Endol 

The enzyme obtained above had a purity of about 90%. The enzyme 
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was found to have a molecular weight of 75 kD on SDS-PAGE r 

In immunoprecipitation experiments (carried out by J '- 
Immunoelectrophoresis in agarose gel as described by n. Axel " 

et al., Chapter 2 .in A Manual r> ? o„ an f^^°^ 

Immunoelectrophorpsi « , Blackwell Scientific Publ. 1973), ~75~jJ^ 
Endo1 was immunoreactive with a monospecific rabbit antibody 
raised against the core region (-58 kD form) of Endol . *% 

The -75 kD and -58 kD (processed) forms of Endol were tested fdr?|§ 3 
their ability to bind to cellulose (Avicel) • lO /ig of each'foffll 
V -75 kD Endol and -58 kD Endol were added to 50 >1 of a cellulosefil 
suspension (0.1% w/w Avicel, 5 mM EDTA, pH 8.0). The suspensioKsifl 
were shaken for 3 0 minutes, and the Avicel was harvested:. atSl§ 
15 10000- rpm for 2 minutes. The amount of Endol cellulase remaihiS^ 
in the supernatant was analysed by SDS-PAGE. More than 95^o|tpl 
. the -75 kD Endol was bound to the cellulose, while less th«Pl||| 
, of the -58 kD Endol was similarly bound. This shows that thef^f 
.. ter minal part of the -75 kD Endol cellulase comprises 3 ?! 
20 cellulose-binding domain. ; .3? 

h-. •• .• . . • ... .0*: 

Extensive amino acid sequence homology was found between" thfl§[ 
region (from amino acid 554 to 700) of the Endol cellulase an|| 
other cellulases, e.g. an endocellulase from Bacillus subtilfll 
25 (Nakamura et al., 1987), the middle part of the bi functionSff 
cellulase from Caldocellmii saceharolvt j esam (D.J. Saul et air|j| 
Nucl.Acids Rf>s. 17, 1988, p. 439), two endocellulases *roaj 
Clostridium stercorarium (W. Schwarz et al., Biotech . Lett. H»3t 
1989, pp. 461-466). c-rCjf* 

30 

C. Colour clarifi cation effect of -75 kD Endol : **$5f 

The colour clarification effect of -75 kD Endol was determined^ 
by exposing a prewashed worn textile surface to the enzyme anjj 
then measuring the clarity of the surface colour compared to tfc«J 



clz 
tre 

Bla 

tUIT: 
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clarity of the surface colour of textiles which had not been 
treated with the enzyme. 

Black 100% cotton swatches (15 x ;10 cm) were prewashed and 
tumble-dried under the following conditions 



Detergent: 



Temperature : 
Washing time: 
Drying time: 
No . prewashing/ 
drying treatments : 15 



Keminus (available from Irma A/S) , 
1.5g/l 
70*C 

60 minutes 
30 minutes 



The swatches were prewashed in a conventional washing machine 
(Mxele Deluxe Electronic W761) . After each wash, the swatches 
were dried in a tumble-drier. The visual effect of the 
prewashing/drying was that the surface colour turned greyish due 
to the presence of damaged cellulose fibres causing the worn 
look. 



After prewashing, the swatches were washed in a Terg-o-Tometer 
(toploaded mini washing machine) under the following conditions 



Liquid volume: 
Agitation: 
Detergent : 
Washing time: 
Washing tempera- 
ture: 

No. of swatches: 
-75 kD Endol 
dosage: 
pH: 



800 ml . 

100 movements/minute 
Standard detergent, 5g/l 
3 0 minutes 

40*C 
2 

0 and 60 CMC endoase units/1 
7.0 



No. of treatments: 3 
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Standard detergent: 

LAS NANSA 1169/P: 10% 

AE Berol 160: 15% ' - 

Ethanol, 96%: io% 
TEA: 5% 
Water: 60% 

After each wash, the swatches were rinsed in tap water and dried « 
at room temperature. 

The surface colour of the swatches was analysed by measuring 
reflected light. White light was projected onto the surface, and 
the reflection/remission was measured at 16 wavelengths (4 00 nm *' L 
- 700 nm) . The results from the measurements were processed (by I 
means of an "Elrepho 2000" apparatus available from Datacolor, J 
Switzerland) into Hunter coordinates of which the L-coordinate 4 
represents the. grey scale values. Each swatch was analysed twice % 
on each side, and the results shown below are are a - total M 
average from the measurements of the two swatches from the same ;I 
treatment. In the table, white is L= loo, and black is L = o. % 



Dosage 0 CMC endoase/160 CMC endoase/1 

L 16.78 15.50 

- S.D. 0.08 0.03 

Delta L 1.28 

Comparable results were obtained with Celluzyme™ (batch PPC 2174 

containing a mixture of enzymes from Humicola insolens r DSM 
1800) 

Dosage 

(CMC endoas /l) o 15 30 60 

Delta L - 1.02 1.48 1.90 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: Novo Nordisk A/S 

(ii) TITLE OF INVENTION: An Enzyme Exhibiting Cellulase Activity 

10 (iii) NUMBER OF SEQUENCES: 7 

(iv) CORRESPONDENCE ADDRESS: 

• (A) ADDRESSEE: Novo Nordisk A/S, Patent Department 
(B) STREET: Novo Alle 
15 (C) CITY: Bagsvaerd 

(E) COUNTRY: DENMARK 

(F) ZIP: DK- 2880 

(v) COMPUTER READABLE FORM: 
2 0 (A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: S-ss- 

(D) SOFTWARE: 

25 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: . .... 

(B) FILING DATE: 1" 

(C) CLASSIFICATION: 



30 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: DK 164/90 

(B) FILING DATE: 19-JAN-1990 . >;a * 

(viii) ATTORNEY/AGENT INFORMATION: 
35 (A) NAME: Thalse-Madsen, Kine Birgit 

(B) REGISTRATION NUMBER: 

(C) REFERENCE/DOCKET NUMBER: 3425.204-W0 

(ix) TELECOMMUNICATION INFORMATION: 
40 (A) TELEPHONE: +45 4444 8888 

(B) TELEFAX: +45 4449 3255 

(C) TELEX: 37304 ... 

45 (2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2977 base pairs 

(B) TYPE: nucleic acid • 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



IS 
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(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus lautus 

(B) STRAIN: NCI MB 40250 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 677.. 2776 
(D) OTHER INFORMATION: 



56 

W 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

ATCTTGAAGC TTCGAGGTTC GCGTAACCAG GTCGATCGGC TGACTTTCCT CGCCAAGCTG 60 

CACCATGGCT TCATAGATTA ATTGGTGAGG AGGATCATAG AAATCTTCCG TCCGCACCGC 120 

TCCATGGCGG TAATCAGCGC TTCCGACTGC AGTACAGAGC GGCCGATACA AATGCAGCCG 180 

ACAACCAGAT CAAGCCGTCC TTCAACATCA AAAACAACGG TACTTCGGCT GTTGATTTAA 240 

GCACGCTCAA AATCCGCTAC TACTTCACCA AGGATGGTTC TGCGGCGGTG AACGGCTGGA 300 

TCGACTGGGC GCAGCTCGGC GGCAGCAACA TTCAGATCTC GTTTGGCAAC CATACTGGCA 360 

CGAATTCGGA TACGTACGTG GAGCTGAGCT TCTCGTCCGA GGCAGGCTCG ATTGCGGCGG 420 

GCGGCCAATC CGGTGAAATC CAGCTGCGCA TGTCCAAGAC GGACTGGTCG AACTTTAACG 480 

AGGCGAACGA CTACTCGTTC GATGGGACGA AGACGGCCTT TGCTGACTGG GATCGGGTCG 540 

TATTGTACCA GAACGGCCAA ATAGTGTGGG GAACTGCTCC ATAAACCGAT ACAGGGGAAT 600 

GTGCCGGAAC CGCTCTTTTG CAGGGCAGAC TGGCGGTATC CCTTGCTGAA ATGACTATTC 660 

CTGGGAGGGA TCAAAA ATG AAG AC A AGA CAA AGA AAG CGG CTG TTC GTC 709 

Met Lys Thr Arg Gin Arg Lys Arg Leu Phe Val 
1 5 10 

AGT GCG GCG CTG GCA GTA TCC TTG ACA ATG ACC GTA CCG ATG CCC GCT 757 
Ser Ala Ala Leu Ala Val Ser Leu Thr Met Thr Val Pro Met Pro Ala 
15 20 25 

TCT GTA AAT GCA GCT GCG AGT GAT GTC ACT TTC ACG ATT AAT ACG CAG 805 
Ser Val Asn Ala Ala Ala Ser Asp Val Thr Phe Thr lie Asn Thr Gin 
30 35 40 

TCG GAA CGT GCA GCG ATC AGC CCC AAT ATT TAC GGA ACC AAT CAG GAT 853 
Ser Glu Arg Ala Ala lie Ser Pro Asn He Tyr Gly Thr Asn Gin Asp 
45 50 55 
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60 



15 



20 



35 



40 



CTG 
Leu 
60 

CTG 
Leu 



AGC GGG ACG GAG 
Ser Gly Thr Glu 



ACG GGT TAC AAC 
Thr Gly Tyr Asn 
80 

TGG CTT CAT TAC AGC 
10 Trp Leu His Tyr Ser 

95 

GAC ACC GAC TGC GAC 
Asp Thr Asp Cys Asp 
110 

AAA TCT TTG GAG AAT 
Lys Ser Leu Glu Asn 
125 

TAT GTG TCC CGG 
Tyr Val Ser Arg 



GGT 
Gly 
140 

25 GCT 
Ala 



CCG TCA CCG CGT 
Pro Ser Pro Arg 
160 

TTC TCC CTT CAG CCT 
30 Phe Ser Leu Gin Pro 
175 

GAA GTT AAC TTC CTG 
Glu Val Asn Phe Leu 
190 

ATC .AAA GCG TAT 
He Lys Ala Tyr 
205 

CAT CCA AGG ATT 
His Pro Arg He 



GGC 
Gly 



ACG 
Thr 
220 

45 GCT 
Ala 



GCC 
50 Ala 



AAG AGC ATC GAC 
Lys Ser He Asp 
240 

GAA ATA TTC GGT 
Glu He Phe Gly 
255 



AAC TGG TCA TCC CGC AGG CTC GGA GGC AAC CGG 
Asn Trp Ser Ser Arg Arg Leu Gly Gly Asn Arg 
65 .70 75 

TGG GAG AAC AAC GCA TCC AGC GCC GGA AGG GAC 
Trp Glu Asn Asn Ala Ser Ser Ala Gly Arg Asp 
85 90 

GAT GAT TTT CTC TGC GGC AAC GGT GGT GTT CCA 
Asp Asp Phe Leu Cys Gly Asn Gly Gly Val Pro 
100 105 

AAG CCG GGG GCG GTT GTT ACC GCT TTT CAC GAT 
Lys Pro Gly Ala Val Val Thr Ala Phe His Asp 
115 120 

GGA GCT TAC TCC ATT GTA ACG CTG CAA ATG GCG 
Gly Ala Tyr Ser He Val Thr Leu Gin Met Ala 
130 135 

GAT AAG AAC GGT CCA GTT GAC GAG AGT GAG ACG 
Asp Lys Asn Gly Pro Val Asp Glu Ser Glu Thr 
145 150 155 

TGG GAT AAG GTC GAG TTT GCC AAA AAT GCG CCG 
Trp Asp Lys Val Glu Phe Ala Lys Asn Ala Pro 
165 170 

GAT CTG AAC GAC GGA CAA GTG TAT ATG GAT GAA 
Asp Leu Asn Asp Gly Gin Val Tyr Met Asp Glu 
180 185 

GTC AAC CGG TAT GGA AAC GCT TCA ACG TCA ACG 
Val Asn Arg Tyr Gly Asn Ala Ser Thr Ser Thr 
195 200 

TCG CTG GAT AAC GAG CCG GCG CTG TGG TCT GAG 
Ser Leu Asp Asn Glu Pro Ala Leu Trp Ser Glu 
210 215 

CAT CCG GAG CAG TTA CAA GCG GCA GAA CTC GTC 
His Pro Glu Gin Leu Gin Ala Ala Glu Leu Val 
225 230 235 

TTG TCA AAG GCG GTG AAG AAC GTC GAT CCG CAT 
Leu Ser Lys Ala Val Lys Asn Val Asp Pro His 
245 250 

CCT GCC CTT TAC GGT TTC GGC GCA TAT TTG TCT 
Pro Ala Leu Tyr Gly Phe Gly Ala Tyr Leu Ser 
260 265 



901 'CTG CAG GAC 
lieu Gin Asi 
tU 27' 

949 iTTT ATC GA 
Phe He As 
285 



997 
1045 
1093 

1141 

WOO 

1189, 

123f 

1285 

1333 

1381 

1429 

1477 



GGC AAA A 

Gly Ly s A 

300 

CAG GGC ( 
Gin Gly < 



ACG CAG 
Thr Gin 



. TAC CAG 
Tyr Glr 



TTA AT 
Leu IV 
36 

I' AAG C" 
| Lys L< 
380 

GGA G 
Gly C 



GTT 
Val 



AGC 
Ser 



TTC 
Ph< 



TC 
Se 
4f 
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90] 



949 



997 



>45 

TA 



Sffi V« wis 



1 15 



<5 



50 

I 



rTG CAG GAC GCA CCG GAT TGG CCG AGT TTG CAA GGC AAC TAC AGC TGG 
[IS Gin Asp Ala Pro Asp Trp Pro Ser Leu Gin Gly Asn Tyr Ser Trp 
270 275 d - 

TTT ATC GAT TAC TAT CTG GAT CAG ATG AAG AAT GCT CAT ACG CAG AAC 
JJe lie Asp Tyr Tyr Leu Asp Gin Met Lys Asn Ala His Thr Gin Asn 
285 290 295 

PGC AAA AGA TTG CTC GAT GTG CTG GAC GTC CAC TGG TAT CCG GAA GCA 
Gl5 £rg ill Leu Asp Val Leu Asp Val His Trp Tyr Pro Glu Ala 
300 305 310 

CAG GGC GGA GGC CAG CGA ATC GTC TTT GGC GGG GCG GGC AAT ATC GAT 
Gin Gly Gly Gly Gin Arg He Val Phe Gly Gly Ala Gly Asn He Asp 
320 325 -« u 

ACG CAG AAG GCT CGC GTA CAA GCG CCA AGA TCG CTA TGG GAT CCG GCT 
Thr Gin Lys Ala Arg Val Gin Ala Pro Arg Ser Leu Trp Asp Pro Ala 
335 340 ""^ 

TAC CAG GAA GAC AGC TGG ATC GGC ACA TGG TTT TCA AGC TAC TTG CCC 
Tyr Gin Glu Asp Ser Trp He Gly Thr Trp Phe Ser Ser Tyr Leu Pro 
350 355 360 

TTA ATT CCG AAG CTG CAA TCT TCG ATT CAG ACG TAT TAT CCG GGT ACG 
Leu lie Pro Lys Leu Gin Ser Ser He Gin Thr Tyr Tyr Pro Gly Thr 
365 - 370 375 

AAG CTG GCG ATC ACA GAG TTC AGC TAC GGC GGA GAC AAT CAC ATT TCG 
Lys Leu Ala He Thr Glu Phe Ser Tyr Gly Gly Asp Asn His He Ser 
380 385 390 395 

GGA GGC ATA GCT ACC GCG GAC GCG CTC GGC ATT TTT GGA AAA TAT GGC 
Gly Gly He Ala Thr Ala Asp Ala Leu Gly He Phe Gly Lys Tyr Gly 
400 405 41° 

GTT TAT GCC GCG AAT TAC TGG CAG ACG GAG GAC AAT ACC GAT TAT ACC 
Val Tyr Ala Ala Asn Tyr Trp Gin Thr Glu Asp Asn Thr Asp Tyr Thr 
415 420 425 

AGC GCT GCT TAC AAG CTG TAT CGC AAC TAC GAC GGC AAT AAA TCG GGG 
Ser Ala Ala Tyr Lys Leu Tyr Arg Asn Tyr Asp Gly Asn Lys Ser Gly 
430 435 440 

TTC GGC TCG ATC AAA GTG GAC GCC GCT ACG TCC GAT ACG GAG AAC AGC 
Vhl G?5 ler lie Lys Val Asp Ala Ala Thr Ser Asp Thr Glu Asn Ser 
445 450 455 

TCG GTA TAC GCT TCG GTA ACT GAC GAG GAG AAT TCC GAA CTC CAC CTG 
Ser Val Tyr Ala Ser Val Thr Asp Glu Glu Asn Ser Glu Leu His Leu 
460 465 470 47b 
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1717 
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1861 
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ATC GTG CTG AAT AAA AAT TTC GAC GAT CCG ATC AAC GCT ACT TTC CAG 
He Val Leu Asn Lys Asn Phe Asp Asp Pro lie Asn Ala Thr Phe Gin 
480 485 . 490 

5 CTG TCT GGT GAT AAA ACC TAC ACA TCC GGG AGA GTA TGG GGC TTC GAC 
Leu Ser Gly Asp Lys Thr Tyr Thr Ser Gly Arg Val Trp Gly Phe Asp 
495 500 505 

CAA ACC GGA TCC GAC ATT ACG GAA CAA GCA GCT ATA ACG AAT ATT AAC 
10 Gin Thr Gly Ser Asp He Thr Glu Gin Ala Ala He Thr Asn He Asn 
510 515 520 

AAC AAT CAA TTC ACG TAT ACG CTT CCT CCA TTG TCG GCT TAC CAC ATT 
Asn Asn Gin Phe Thr Tyr Thr Leu Pro Pro Leu Ser Ala Tyr His lie 
15 525 530 535 

GTT CTG AAA GCG GAT AGC ACC GAA CCG GTC AAC TCC GAT CTC GTC GTG 
Val Leu Lys Ala Asp Ser Thr Glu Pro Val Asn Ser Asp Leu Val Val 
540 545 550 555 

CAG TAT AAG GAC GGT GAT CGC AAC AAT GCA ACC GAC AAT CAG ATC AAG 
Gin Tyr Lys Asp Gly Asp Arg Asn Asn Ala Thr Asp Asn Gin He Lys 
560 555 570 

CCG CAT TTC AAT ATT CAA AAT AAA GGG ACC AGC CCG GTA GAT CTG AGT 
Pro His Phe Asn lie Gin Asn Lys Gly Thr Ser Pro Val Asp Leu Ser 
575. 580 585 

TCC CTT ACC CTG CGC TAC TAT TTT ACC AAA GAC AGC TCT GCA GCG ATG 
Ser Leu Thr Leu Arg Tyr Tyr Phe Thr Lys Asp Ser Ser Al a Al a Met 
590 595 600 

AAC GGC TGG ATC GAT TGG GCG AAG CTC GGC GGC AGC AAC ATT CAG ATT 
Asn Gly Trp He Asp Trp Ala Lys Leu Gly Gly Ser Asn lie Gin He 
605 610 615 

TCG TTC- GGT AAT CAT AAT GGC GCG GAT TCG GAT ACG TAC GCG GAG CTG 
Ser Phe Gly Asn His Asn Gly Ala Asp Ser Asp Thr Tyr Ala Glu Leu 
620 625 630 635 

GGC TTC TCG TCC GGC GCA GGC TCG ATT GCG GAG GGC GGT CAA TCC GGC 
Gly Phe Ser Ser Gly Ala Gly Ser He Ala Glu Gly Gly Gin Ser Gly 
640 645 650 

45 GAA ATC CAG CTG CGC ATG TCG AAG GCG GAC TGG TCG AAC TTC AAC GAG 
Glu He Gin Leu Arg Met Ser Lys Ala Asp Trp Ser Asn Phe Asn Glu 
655 660 665 

GCG AAC GAC TAC TCG TTC GAT GGG GCG AAG ACG GCC TAT ATA GAT TGG 
50 Ala Asn Asp Tyr Ser Phe Asp Gly Ala Lys Thr Ala Tyr He Asp Trp 
670 675 680 



2149 '.SAT CGC GT 

149 Asp Arg Va 
i 685 

2197 t CG TAGAAG 
{Pro 

I 700 

2245 |CTGTCTTGA( 

[gcgattcaa 

2293 IcCGTTAGGG 
(2) INFOf 



2341 



2389 



2437 
2485 
2533 
2581 



2629 



2677 



2725 



( 
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(: 

Met Lys 
1 

Val Sei 
Ala Se 



lie S€ 



Asn V 
65 

Trp G 
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GAT CGC GTG ACG CTA TAC CAA GAC GGA CAA CTC GTA TGG GGA ATC GAG 

Asp Arg Val Thr Leu Tyr Gin Asp Gly Gin Leu Val Trp Gly lie Glu 

H 685 690 695 

CCG TAGAAGATGA CTAGACAACA TTAGTGATGA GACGCGGCCG GCCATAACGG 

Pro 
700 

CTGTCTTGAC TCTGATTCGA TCAAAAAATC AAAGCAAAGG GGATGAAAGT AATGAATGTT 
GCGATTCAAA AGAGAATCGG ATCAATATTG ATGATTGCCT CACTAATTAT TAGCTTATTG 
CCGTTAGGGA GCAGCAGAAC GTATGCTGCA G 



2773 

2826 

2886 
2946 
2977 



(2) INFORMATION FOR SEQ ID N0:2: 



I 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 700 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Lys Thr Arg Gin Arg Lys Arg Leu Phe Val Ser Ala Ala Leu Ala 

Val Ser Leu Thr Met Thr Val Pro Met Pro Ala Ser Val Asn Ala Ala 
20 25 30 

Ala Ser Asp Val Thr Phe Thr He Asn Thr Gin Ser Glu Arg Ala Ala 
35 40 45 

He Ser Pro Asn He Tyr Gly Thr Asn Gin Asp Leu Ser Gly Thr Glu 
50 55 60 

Asn Trp Ser Ser Arg Arg Leu Gly Gly Asn Arg Leu Thr Gly Tyr Asn 
65 70 75 

Trp Glu Asn Asn Ala Ser Ser Ala Gly Arg Asp Trp Leu His Tyr Ser 

85 90 95 

Asp Asp Phe Leu Cys Gly Asn Gly Gly Val Pro Asp Thr Asp Cys Asp 
100 .105 110 

Lys Pro Gly Ala Val Val Thr Ala Phe His Asp Lys Ser Leu Glu Asn 
U5 120 125 

Glv Ala Tyr Ser He Val Thr Leu Gin Met Ala Gly Tyr Val Ser Arg 
130 135 140 
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Asp Lys Asn Gly Pro Val Asp Glu Ser Glu Thr Ala Pro Ser Pro Arq 
145 150 155 160 

5 Trp Asp Lys Val Glu Phe Ala Lys Asn Ala Pro Phe Ser Leu Gin Pro 

165 170 175 

Asp Leu Asn Asp Gly Gin Val Tyr Met Asp Glu Glu Val Asn Phe Leu 
180 185 190 

Val Asn Arg Tyr Gly Asn Ala Ser Thr Ser Thr Gly He Lys Ala Tyr 
195 200 205 

Ser Leu Asp Asn Glu Pro Ala Leu Trp Ser Glu Thr His Pro Arg He 
210 215 220 

His Pro Glu Gin Leu Gin Ala Ala Glu Leu Val Ala Lys Ser He Asp 
225 230 235 240 

20 Leu Ser Lys Ala Val Lys Asn Val Asp Pro His Ala Glu He Phe Gly 

245 250 255 



10 



15 



Pro Ala Leu Tyr Gly Phe Gly Ala Tyr Leu Ser Leu Gin Asp Ala Pro 
260 265 270 

Asp Trp Pro Ser Leu Gin Gly Asn Tyr Ser Trp Phe He Asp Tyr Tyr 
275 . 280 285 

Leu Asp Gin Met Lys Asn Ala His Thr Gin Asn Gly Lys Arg Leu Leu 
290 295 300 

Asp Val Leu Asp Val His Trp Tyr Pro Glu Ala Gin Gly Gly Gly Gin 
305 310 315 320 

35 Arg He Val Phe Gly Gly Ala Gly Asn He Asp Thr Gin Lys Ala Arg 

325 330 335 



25 



30 



40 



Val Gin Ala Pro Arg Ser Leu Trp Asp Pro Ala Tyr Gin Glu Asp Ser 
340 345 350 

Trp He Gly Thr Trp Phe Ser Ser Tyr Leu Pro Leu He Pro Lys Leu 
355 360 365 

Gin Ser Ser He Gin Thr Tyr Tyr Pro Gly Thr Lys Leu Ala He Thr 
370 375 380 

Glu Phe Ser Tyr Gly Gly Asp Asn His He Ser Gly Gly He Ala Thr 
385 390 395 400 

50 Ala Asp Ala Leu Gly He Phe Gly Lys Tyr Gly Val Tyr Ala Ala Asn 

405 410 415 



45 



| Tyr Trp G" 



Leu Tyr A 
4 

Val Asp I 
450 

Val Thr 
465 



Asn Phe 
Thr Tyr 
lie Thr 



•■J 



Tyr Thi 
531 

Ser Th 
545 

Asp Ay 
Gin A 
Tyr T 
Trp I 



) Asn 
625 

Ala 



Met 
Phe 



■■Mi 
■■mi 
■mi 
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Tyr Trp Gin Thr 61 u Asp Asn Thr Asp Tyr Thr Ser Ala Ala Tyr Lys 



420 



425 



Leu Tyr Arg Asn Tyr Asp Gly Asn Lys Ser Gly Phe Gly Ser He Lys 



435 



440 



1 



Val Asp Ala Ala Thr Ser Asp Thr Glu Asn Ser Ser Val Tyr Ala Ser 
450 455 460 

Val Thr Asp Glu Glu Asn Ser Glu Leu His Leu He Val Leu Asn Lys 



475 



465 470 

Asn Phe Asp Asp Pro He Asn Ala Thr Phe Gin Leu Ser Gly Asp Lys 



485 



490 



■ Thr Tyr Thr Ser Gly Arg Val Trp Gly Phe Asp Gin Thr Gly Ser Asp 
iss j >-„~ 505 diu 



500 



lie Thr Glu Gin Ala Ala He Thr Asn He Asn Asn Asn Gin Phe Thr 

515 " 520 5Z5 

Tvr Thr Leu Pro Pro Leu Ser Ala Tyr His He Val Leu Lys Ala Asp 

530 535 540 

Ser Thr Glu Pro Val Asn Ser Asp Leu Val Val Gin Tyr Lys Asp Gly 

If 545 550 555 bbU 

Asp Arg Asn Asn Ala Thr Asp Asn Gin lie Lys Pro His Phe Asn lie 



565 



570 



Gin Asn Lys Gly Thr Ser Pro Val Asp Leu Ser Ser Leu Thr Leu Arg 
580 585 59° 

Tyr Tyr Phe Thr Lys Asp Ser- Ser Ala Ala Met Asn Gly Trp He Asp 



595 



600 



Trp Ala Lys Leu Gly Gly Ser Asn He Gin He Ser Phe Gly Asn His 
610 615 620 

| Asn Gly Ala Asp Ser Asp Thr Tyr Ala Glu Leu Gly Phe Ser Ser Gly 



i. 625 



630 



Ala Gly Ser He Ala Glu Gly Gly Gin Ser Gly Glu He Gin Leu Arg 



645 



650 



45 



I Met Ser Lys Ala Asp Trp Ser Asn Phe Asn Glu Ala Asn Asp Tyr Ser 
| 660 665 0/u 

| Phe Asp Gly Ala Lys Thr Ala Tyr He Asp Trp Asp Arg Val Thr Leu 
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45 



Tyr Gin Asp Gly Gin Leu Val Trp Gly He Glu Pro 
690 695 700 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 1 

(A) LENGTH: 2323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus lautus 

(B) STRAIN:- NCIMB 40250 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 172.. 1869 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
GGGGTTTTGT TCATTTCAAG AAAGAGCACG AATAAACATC 
TTCTCTAATT. ATTATGGTGC ATGCCAATTG GAAAATGTAG 
GCTTTCATGC CAAGATATGT AATTTTTAAA AAAGAGTAAA 



50 



AAA CGT AGA AGC AGT AAA GTT ATT CTT TCG TTG 
Lys Arg Arg Ser Ser Lys Val He Leu Ser Leu 
5 10 

TTA TTG GCA GCC GTC GAA CCT AAT GCC GCT TTA 
Leu Leu Ala Ala Val Glu Pro Asn Ala Ala Leu 
20 25 

AGT GCC ATG CAG TCC TAT GTT GAA GCG ATG CAG 
Ser Ala Met Gin Ser Tyr Val Glu Ala Met Gin 
35 40 45 

GGC AAT TCT CTG GAT GCT GTC GGT GCG GAT GAG 
Gly Asn Ser Leu Asp Ala Val Gly Ala Asp Glu 
55 60 

AAT CCG CGG ATC ACG AAA GAG CTC ATT CAG AAC 
Asn Pro Arg lie Thr Lys Glu Leu He Gin Asn 
70 75 



GCTAATTTAC ACATGATTTA 

TAGATTAGTC ATCGTAACCT 

GGAGAATTCA G ATG AAA 
Met Lys- 
1 

GCC ATC GTT GTT GCA 
Ala He Val Val Ala 
15 

GCA GCG GCT CCA CCA 
Ala Ala Ala Pro Pro 
30 

CCT GGC TGG AAC CTT 
Pro Gly Trp Asn Leu 
50 

ACG CTG GCA CGG GGC 
Thr Leu Ala Arg Gly 
65 

ATC GCT GCG CAA GGC 
He Ala Ala Gin Gly 
80 



1AT AAG AG< 
for Lys Se 

! G CC CCA AA 
ffcla Pro As 
100 

!gtc GTA C, 
fval Val G 
1115 

CAT CAT G 
iHis His / 



CAA GTA 
Gin Val 



<60 
A 120 

177: 

.-c.A nf3 



225 



273 



321 



369 



417 



TTC AAG 
Phe Lys 



CGC TTT 
Arg Phe 
18C 

CTG GAl 
Leu Asi 
195 

GGC C/> 
Gly Gl 



CCC A' 
Pro T 



AAC G 
Asn / 



TTT 
Phe 



AAT 
Asn 
275 
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k S 55 S - S - ?S S iS E SK ?R R - 

' 85 9° 9b 

rrr rra AAT TAT CAA ATT GAA GCT GCG TAC CTC AAT CGA GTG CAG GAG 
la Pro ™ T$r IT, He GlS Ala Ala Tyr Leu Asn Arg Val Gin Glu 
100 1° 5 110 

GTA CAG TGG GCT TTG GAC GCG AAC CTC TAT GTG ATG ATT AAT GTC 
Sal Gl5 Trp Ala Leu Asp Ala Asn Leu Tyr Val Met He Asn Val 
120 125 

pat r at TCC TGG CTA TGG ATC AGC AAA ATG GAG TCG CAG CAC GAT 
SE'SS Ser Trp Lei !rp He Ser Lys Met Glu Ser Gin His Asp 
135 - - 140 iHO 

GTA CTG GCC CGT TAT AAT GCG ATT TGG ACQ CAA ATT GCC AAC AAG 
Val Leu Ala Arg Tyr Asn Ala He Trp Thr Gin He Ala Asn Lys 
150 155 



GTC 
Val 
115 

CAT 
His 



CAA 
Gin 



TTC 
Phe 



| CGC 
I Arg 



I 



AAG AAC AGC CCG AGC AAG CTG ATG TTC GAG AGC GTG AAT GAG CCT 
{5s A^n Ser Pro Ser , Lys Leu Met Phe Glu Ser Val Asn Glu Pro 
165 170 175 

TTT ACG GAT GGC GGA ACT ACG GAT GAA GCC AAG CAG CAA AAA ATG 
Phi Thr Asp Gly Gly Thr Thr Asp Glu Ala Lys Gin Gin Lys Met 
180 185 190 

CTG GAC GAG CTG AAC GTA TCC TTT TTC AAC ATC GTC AGA AAT TCC GGC 
Leu Asp Glu Leu Asn Val Ser Phe Phe Asn lie Val Arg Asn Ser Gly 
195 200 205 

GGC CAG AAC GCG ACT CGC CCG CTA GTT CTT TCT ACG TTG GAG GCC TCT 
Gly Gin Asn Ala Thr Arg Pro Leu Val Leu Ser Thr Leu Glu Ala ser 
215 "0 " 3 

CCC ACC CAA GAG AGA ATG ACG GCG CTT TAT AAT ACG ATG ACC AAA CTG 
Pro Thr Gin Glu Arg Met Thr Ala Leu Tyr Asn Thr Met Thr Lys Leu 
230 235 

AAC GAC AAG AAT CTG ATC GCA ACC GTT CAT TTT TAT GGA TTC TGG CCG 
Asn Asp Lys Asn Leu He Ala Thr Val His Phe Tyr Gly Phe Trp Pro 
24 5 

TTT AGC GTA AAT ATC GCA GGA TAT ACG AAA TTT GAT GCG GAG ACG CAA 
Phe Ser Val Asn lie Ala Gly Tyr Thr Lys Phe Asp Ala Glu Thr bin 
260 265 . 270 

AAT GAT ATT ATA ACG ACC TTC GAT AAC GTG TAT AAC ACA TTT GTA GCA 
Asn Asp lie lie Thr Thr Phe Asp Asn Val Tyr Asn Thr Phe Val A.a 
275 280 285 
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AAG GGA ATC CCG GTG GTA GTC GGC GAA TAT GGC CTT CTT GGA TTC GAT 
Lys Gly He Pro Val Val Val Gly Glu Tyr Gly Leu Leu Gly Phe Asp 
295 300 305 

AAG AAT ACC GGC GTC ATT GAA CAG GGT GAG AAA TTG AAA TTT TTC GAG 
Lys Asn Thr Gly Val He Glu Gin Gly Glu Lys Leu Lys Phe Phe Glu 
310 315 320 

TTT TTT GCC CAG TAT GTG AAG CAA AAA AGC ATT TCC ACT ATG CTA TGG 
Phe Phe Ala Gin Tyr Val Lys Gin Lys Ser He Ser Thr Met Leu Trp 
325 330 335 

GAT AAC GGA CAG CAC TTC AAC CGC ACG AGC TTC AAG TGG TCT GAC CCG 
Asp Asn Gly Gin His Phe Asn Arg Thr Ser Phe Lys Trp Ser Asp Pro 

340 ... - 345 350 

GAT TTA TTC AAT ATG ATC AAG GCC AGT TGG ACC GGA CGT TCA TCC ACG 
Asp Leu Phe Asn Met He Lys Ala Ser Trp Thr Gly Arg Ser Ser Thr 
355 360 365 370 

GCT TCC AGC GAC CTG ATC CAT GTC AAG CAG GGC ACG GCG GTA AAA GAT 
Ala Ser Ser Asp Leu He His Val Lys Gin Gly Thr Ala Val Lys Asp 
375 380 385 

ACT TCG GTT CAG CTC AAT CTT AAC GGG AAT ACG CTA ACT TCC CTT TCC 
Thr Ser Val Gin Leu Asn Leu Asn Gly Asn Thr Leu Thr Ser Leu Ser 
390 395 400 

GTA AAT GGA ACG ACA CTG AAA TCA GGC ACA GAT TAC ACT TTA AAC AGC 
Val Asn Gly Thr Thr Leu Lys Ser Gly Thr Asp Tyr Thr Leu Asn Ser 
405 410 415 

AGC AGA TTA ACT TTT AAA GCG AGC CAG TTG ACC AAG CTG ACC TCC TTG 
Ser Arg Leu Thr Phe Lys Ala Ser Gin Leu Thr Lys Leu Thr Ser Leu 
420 425 430 

GGC AAA "TTG GGG GTC AAC GCG ACG ATC GTG ACT AAA TTC AAT AGA GGC 
Gly Lys Leu Gly Val Asn Ala Thr He Val Thr Lys Phe Asn Arg Gly 
435 440 445 450 

GCC GAC TGG AAG TTC AAC GTA GTC CTG TAC AAT ACG CCT AAG CTT AGC 
Ala Asp Trp Lys Phe Asn Val Val Leu Tyr Asn Thr Pro Lys Leu Ser 
455 460 465 

AGT ACG ACG GGG ACT ACT TCT TCC TTT GCG ATT CCA ACG GCT TTC AAC 
Ser Thr Thr Gly Thr Thr Ser Ser Phe Ala He Pro Thr Ala Phe Asn 
470 475 480 

GGG GAT CAG CTT GCT ACG ATG GAA GCG GTC TAT GTA AAC GGC GGC AAT 
Gly Asp Gin Leu Ala Thr Met Glu Ala Val Tyr Val Asn Gly Gly Asn 
485 490 495 



1089 
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1185 



1233 



1281 



1329 



1377 



: 1425 
1473 
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rrr GGT CCG CAT AAC TGG ACT TCC TTT AAG GAA TTC GAA ACG ACG TTC 
AU G?J Pro His Asn Trp Thr Ser Phe Lys Glu Phe Glu Thr Thr Phe 
500 505 510 

AGC CCC GCT TAT AGC GAG GGG AAA ATC AAA CTG CAG* CAG GCG TTC TTT 
Ser Pro Ala Tyr Ser Glu Gly Lys He Lys Leu Gin Gin Ala Phe Phe 
515 520 525 i3D 

AAT GAA GTG AAT GAT ACC ACA GTC ACG CTC AAG TTC CM TTC TGG AGC 
Asn Glu Val Asn Asp Thr Thr Val Thr Leu Lys Phe Gin Phe Trp Ser 
535 540 M:> 

GGG GAG ATC GTC AAC TAC ACG ATT AAA AAG AGC GGT TCG ACG GTG ACG, 
Gly Glu He Val Asn Tyr Thr He Lys Lys Ser Gly Ser Thr Val Thr 

~ 550 - " 555 560 

GGT ACG GCT TCA TAAGCGAGTT TGGCAAAAAA GGACCGATAT ACTGCCTAAT 
Gly Thr Ala Ser 
565 

TTGGTATTGC CTTAGTTGAA AGCAATTGCT CCGAATAAAC AGAATGAAGC CCCGGCCAGC 
TGGCCGGGAC TTATGCGTTT AGGAAGTATA AACGAATCAT CAGCAATTTA TTTAGCTCGT 
CTCAGTTCAG CAATATCGGC TTCATGTGAA ACGGAGCGGA TGAACAATCT TTCGAGCAAT 
TTCTCATGCT CCTGCTGGGT TTGGAGAACG GTTTGCTGAT TAGTTTTAAG TACAGATATA 
TCCTCACGGA CTTGATTGAT TCATGTGGTC CGTTAGTTCT TCTACCTTTG TATTTGTGGC 
AG C AAC G ATA TGAATTAATT GTTGAATGTG CCCGCCATGA CTGTTTAGCT GCTCATTGTG 
GCTTTGTAAC TGTTCTCGGA TTTCTTTGAA TTCTTGGTCG TGCTCATTAA GCTT 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Lys Lys Arg Arg Ser Ser Lys Val lie Leu Ser Leu Ala lie Val 
1 5 10 13 

Val Ala Leu Leu Ala Ala Val Glu Pro Asn Ala Ala Leu Ala Ala Ala 
20 25 30 
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Pro Pro Ser Ala Met Gin Ser Tyr Val Glu Ala Met Gin Pro Gly Trp 
35 40 .45 

5 Asn Leu Gly Asn Ser Leu Asp Ala Val Gly Ala Asp Glu Thr Leu Ala 
50 55 60 

Arg Gly Asn Pro Arg He Thr Lys Glu Leu He Gin Asn lie Ala Ala 
65 70 75 80 

10 

Gin Gly Tyr Lys Ser He Arg lie Pro Val Thr Trp Asp Ser His He 
85 90 . 95 

Gly Ala Ala Pro Asn Tyr Gin lie Glu Ala Ala Tyr Leu Asn Arg Val 
15 100 105 110 

Gin Glu Val Val Gin Trp Ala Leu Asp Ala Asn Leu Tyr Val Met He 
115 120 125 

2 0 Asn Val His His Asp Ser Trp Leu Trp He Ser Lys Met Glu Ser Gin 
130 135 140 



His Asp Gin Val Leu Ala Arg Tyr Asn Ala He Trp Thr Gin lie Ala 
145 150 155 160 

Asn Lys Phe Lys Asn Ser Pro Ser Lys Leu Met Phe Glu Ser Val Asn 
165 170 175 

Glu Pro Arg Phe Thr Asp Gly Gly Thr Thr Asp Glu Ala Lys Gin Gin 
180 . " 185 190 

Lys Met Leu Asp Glu Leu Asn Val Ser Phe Phe Asn lie Val Arg Asn 
195 200 205 

35 Ser Gly Gly Gin Asn Ala Thr Arg Pro Leu Val Leu Ser Thr Leu Glu 
210 215 220 



40 



Ala Ser Pro Thr Gin Glu Arg Met Thr Ala Leu Tyr Asn Thr Met Thr 
225 . 230 235 240 

Lys Leu Asn Asp Lys Asn Leu He Ala Thr Val His Phe Tyr Gly Phe 
245 250 255 

Trp Pro Phe Ser Val Asn He Ala Gly Tyr Thr Lys Phe Asp Ala Glu 
260 265 270 

Thr Gin Asn Asp He He Thr Thr Phe Asp Asn Val Tyr Asn Thr Phe 
275 280 285 

50 Val Ala Lys Gly He Pro Val Val Val Gly Glu Tyr Gly Leu Leu Gly 
290 295 300 



45 



phe Asp Lys 
*305 

'phe Glu Ph< 



Leu Trp As 



Asp Pro A: 

Ser Thr A 
370 

Lys Asp 1 
385 

Leu Ser 1 



Asn Ser 
M Ser Leu 



Arg Gly 
450 

Leu Set 
465 

Phe As' 



Gly As 
Thr PI 



Phe P 
5 

Trp S 
545 

Val : 
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Phe Asp Lys Asn Thr Gly Val lie Glu Gin Gly Glu Lys Leu Lys Phe 
*305 "* in 



Phe Glu Phe Phe Ala Gin Tyr Val Lys Gin Lys Ser He Ser Thr Met 



325 



330 



Leu Trp Asp Asn Gly Gin His Phe Asn Arg Thr Ser Phe Lys Trp Ser 



340 



345 



Asp Pro Asp Leu Phe Asn Met lie Lys Ala Ser Trp Thr Gly Arg Ser 



355 



360 



Ser 



Thr Ala Ser Ser Asp Leu He His Val Lys Gin Gly Thr Ala Val 



370 



375 



380 



Lys Asp Thr Ser Val Gin Leu- Asn Leu Asn Gly Asn Thr Leu Thr Ser 
385 390 39i> 

Leu Ser Val Asn Gly Thr Thr Leu Lys Ser Gly Thr Asp Tyr Thr Leu 



405 



Asn Ser Ser Arg Leu Thr Phe Lys Ala Ser Gin Leu Thr Lys Leu Thr 



0.. 



7 r\Y! 



420 



425 



Ser Leu Gly Lys Leu Gly Val Asn Ala Thr He Val Thr Lys Phe Asn 



435 



440 



Arg Gly Ala Asp Trp Lys Phe Asn Val Val Leu Tyr Asn Thr Pro Lys 

450 455 4 

Leu Ser Ser Thr Thr Gly Thr Thr Ser Ser Phe Ala He Pro Thr Ala 



465 



470 



475 



50 



Phe Asn Gly Asp Gin Leu Ala Thr Met Glu Ala Val Tyr Val Asn Gly 
485 490 

Gly Asn Ala Gly Pro His Asn Trp Thr Ser Phe Lys Glu Phe Glu Thr 
500 505 

Thr Phe Ser Pro Ala Tyr Ser Glu Gly Lys He Lys Leu Gin Gin Ala 
515 520 525 

Phe Phe Asn Glu Val Asn Asp Thr Thr Val Thr Leu Lys Phe Gin Phe 



530 



535 



540 



Trp Ser Gly Glu He Val Asn Tyr Thr He Lys Lys Ser Gly Ser Thr 



545 



550 



Val Thr Gly Thr Ala Ser 
565 
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(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(v) FRAGMENT TYPE: N-terminal 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus lautus 

(B) STRAIN: NCIMB 40250 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 30. . (1625.1775) 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /partial 

/evidence- EXPERIMENTAL 
/transl_except= (pos: 1446 



1458, aa: OTR) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 
TTGAAGCGCT GAATTCAGGA GGTTAAATAA TGCGTATTCA TGCAATTCGG CAATCTTGCC 
GTTTGGTATT GACGATGGTT TTGATGCTTG GCTTATTGCT GCCTGTGGGC GCCCCCAAAG 
GCTATGCCGC TCCGGCTGTT CCTTTTGGCC AATTAAAAGT TCAGGGCAAT CAATTGGTAG 
GACAGTCCGG GCAAGCTGTT CAACTGGTTG GCATGAGCTC ACATGGATTG CAGTGGTATG 
GCAATTTCGT CAACAAATCG TCGTTGCAGT GGATGAGAGA TAACTGGGGC ATCAACGTCT 
TCCGTGCCGC TATGTATACT TCCGAAGATG GTTACATTAC GGATCCTTCC GTCAAGAACA 
AGGTGAAGGA GGCGGTTCAG GCATCCATCG ATCTGGCCTT GTACGTTATT ATTGACTGGC 
ATATCTTGTC TGATGGGAAT CCGAATACGT ACAAAGCGCA ATCGAAAGCG TTCTTCCAAG 
AGATGGCCAC ATTGTACGGC AACACGCCGA ATGTAATCTA TGAAATCGCG ACGAGCCCAA 
CGGAATGTGT CCTGGGCAGA TGTCAGTCGT CGGAAGAAGT GATCACGGCC ATTCGTTCGA 
TTGACCCCGA CGGAGTGGTT ATCGTTGGCA GTCCAACCTG GAGCCAGGAT ATTCATCTGG 
CGGCCGATAA CCCGGTATCA CATAGCAATG TCATGTATGC GCTTCATTTC TATTCAGGCA • 
CGCATGGACA GTTTTTGAGA GACCGAATTA CCTATGCGAT GAACAAAGGA GCAGCGATCT 



J -mc 

-. 60 L 
f 120 

240. 

300 

360 : 

420 

480 

540 

600 

660 

720 

780 



[TCGTTACCGA 
' CCAAAGAGTG 
; CTGATAAAG* 
CGATGCCCA 
GGCGGCAGG 
ACGCCCAGG 
\ GAGCAACGi 
ACACGAAC. 
GCGCGGGC 
GTACGGGG 
TGAAGCC" 
AGCTTNN! 
CGCAAAT 
ATACTTP 
GGACNNl 
NNNNNNI 
NNNNNN 

(2) W 
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TCGTTACCGA GTGGGGCACC AGTGATGCAT CCGGGAACGG CGGGCCGTAT TTGCCTCAGT 
CCAAAGAGTG GATCGATTTC TTGAATGCTC GCAAGATCAG CTGGGTGAAC TGGTCGCTCG 
CTGATAAAGT AGAAACGTCT GCTGCTCTTA TGCCAGGTGC ATCGCCTACC GGCGCTGGAC 
CGATGCCCAA TTGTCGAATG GGCAAATCGG GTTCGCGATC AAATCCGGCA AGCAACTGGA 
I GGCGGCAGGG CAATCCAACT GCACCGGCTG CCCCTACTAA CCTCTCGGCA AACGGCGGCA 
[ ACGCCCAGGT ATCATTAACC TGGAACGCAG TTAGGGGGGC GACGAGCTAT ACCGTAAAGC 
1 GAGCAACGAC GAGCGGCGGT CCGTACACGA ATGTGGACCG GGGTGTCACG GCGACGAGCT 
ACACGAACAC CGGGCTGACG AATGGCACGA CGTATTATTA TGTCGTGAGG GCATCCAATA. 
GCGCGGGCAG CAGCGCGAAC TCCGCGCAAG CGAGCGCAAC GCCGGCTAGC GGCGGCGCCA 
I GTACGGGGAA CCTTGTTGTC CAATACAAAG TTGGCGACAC TAGCGCCACG GATAACCAAA 
"J ; TGAAGCCTTC CTTTAACATC AAGAACAACG GTACAACCCC TGTTAACCTG AGCGGCCTCA 
j I AGCTTNNNNN NNNNNNNNAA AAAGACGGAC CTGCGGATAT GAGCTGCTCG ATCGACTGGG 1500 
| I GGCAAATCGG CCGAACGAAT GTTCTGCTGG CATTCGCTAA CTTTACCGGG AGTAATACGG 1560 
I I ATACTTACTG TTGTGAGCTA AGCTTTAGCT GCACTGCAGG TTCGTATCCC GGCTATGCGT 
1 I GGACNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
if NNNNNNNNNN NNNNNNNNNN" NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
Hf NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNN 
(2) INFORMATION FOR SEQ ID N0:6: 



8 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1609 base pairs 
p ) (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: N-terminal 
(vi) ORIGINAL SOURCE: ■ 

(A) ORGANISM: Bacillus lautus 

(B) STRAIN: NCIMB 40250 



I 



5. 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



1620 
1680 
1740 
1775 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 30.. 1607 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

TTGAAGCGCT GAATTCAGGA GGTTAAATA ATG CGT ATT CAT GCA ATT CGG CAA 

Met Arg He His Ala He Arg Gin 
1 5 

TCT TGC CGT TTG GTA TTG ACG ATG GTT TTG ATG CTT GGC TTA TTG CTG 
Ser Cys Arg Leu Val Leu Thr Met Val Leu Met Leu Gly Leu Leu Leu 
10 15 20 

CCT GTG GGC GCC CGG AAA GGC TAT GCC GCT CCG GCT GTT CCT TTT GGC 
Pro Val Gly Ala Pro Lys Gly Tyr Ala Ala Pro Ala Val Pro Phe Gly 
25 30 35 40 

CAA TTA AAA GTT CAG GGC AAT CAA TTG GTA GGA CAG TCC GGG CAA GCT 
Gin Leu Lys Val Gin Gly Asn Gin Leu Val Gly Gin Ser Gly Gin Ala 
45 50 55 

GTT CAA CTG GTT GGC ATG AGC TCA CAT GGA TTG CAG TGG TAT GGC AAT 
Val Gin Leu Val Gly Met Ser Ser His Gly Leu Gin Trp Tyr Gly Asn 
60 65 70 

TTC GTC AAC AAA TCG TCG TTG CAG TGG ATG AGA GAT AAC TGG GGC ATC 
Phe Val Asn Lys Ser Ser Leu Gin Trp Met Arg Asp Asn Trp Gly He 
75 80 85 

AAC GTC TTC CGT GCC GCT ATG TAT ACT TCC GAA GAT GGT TAC ATT ACG 
Asn Val Phe Arg Ala Ala Met Tyr Thr Ser Glu Asp Gly Tyr He Thr 
90 95 100 

GAT CCT TCC GTC AAG AAC AAG GTG AAG GAG GCG GTT CAG GCA TCC ATC 
Asp Pro. Ser Val Lys Asn Lys Val Lys Glu Ala Val Gin Ala Ser He 
105 110 115 120 

GAT CTG GCC TTG TAC GTT ATT ATT GAC TGG CAT ATC TTG TCT GAT GGG 
Asp Leu Ala Leu Tyr Val He He Asp Trp His He Leu Ser Asp Gly 
125 130 135 

AAT CCG AAT ACG TAC AAA GCG CAA TCG AAA GCG TTC TTC CAA GAG ATG 
Asn Pro Asn Thr Tyr Lys Ala Gin Ser Lys Ala Phe Phe Gin Glu Met 
140 145 150 

GCC ACA TTG TAC GGC AAC ACG CCG AAT GTA ATC TAT GAA ATC GCG ACG 
Ala Thr Leu Tyr Gly Asn Thr Pro Asn Val He Tyr Glu He Ala Thr 
155 160 165 



53 



101 



149 



197 



245 



293, 



341. 



389 



437 



485 



533 



AGC CCA AC 
Ser Pro Tt 
170 

ATC ACG G 
lie Thr A 
185 

AGT CCA / 
Ser Pro 



TCA CAT 
Ser His 



GGA CAG 
Gly Gin 



GCG ATC 
Ala He 
25( 

GGG CC( 
Gly Pr 
265 

CGC W> 
Arg L} 



TCT G> 
Ser A 



CCC t 
Pro / 



AAC 
Asn 



CTC 
Leu 
345 

GTT 
Val 
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AGC CCA ACG GAA TGT GTC 
Ser Pro Thr Glu Cys Val 
170 

ATC ACG GCC ATT CGT TCG 
He Thr Ala lie Arg Ser 
185 190 

AGT CCA ACC TGG AGC CAG 
Ser Pro Thr Trp Ser Gin 
205 



CTG GGC AGA TGT CAG TCG TCG GAA GAA GTG 
Leu Gly Arg Cys Gin Ser Ser Glu Glu Val 
175 180 

ATT GAC CCC GAC GGA GTG GTT ATC GTT GGC 
lie Asp Pro Asp Gly Val .Val He Val Gly 
195 200 

GAT ATT CAT CTG GCG GCC GAT AAC CCG GTA 
Asp He His Leu Ala Ala Asp Asn Pro Val 
210 215 



50 



TCA CAT AGC AAT GTC ATG TAT GCG CTT CAT TTC TAT TCA GGC ACG CAT 
5? Hll Ser aX 55 Met Tyr Ala Leu His Phe Tyr Ser Gly Thr His 
220 225 23 

GGA CAG TTT TTG AGA GAC CGA -ATT ACC TAT GCG ATG AAC AAA GGA GCA 
Gly Gin Phe Leu Arg Asp Arg lie Thr Tyr Ala Met Asn Lys Gly Ala 

235 

rrr ATC TTC GTT ACC GAG TGG GGC ACC AGT GAT GCA TCC GGG AAC GGC 
AU He III vIT ?hr G?u Trp Gly Thr Ser Asp Ala Ser Gly Asn Gly 
250 2 55 260 

rrr r rfi "TAT TTG CCT CAG TCC AAA GAG TGG ATC GAT TTC TTG AAT GCT 
Gly Pro lyr III Pri Glh Ser Lys Glu Trp lie Asp Phe Leu Asn Ala 
265 270 27 

CGC AAG ATC AGC - TGG GTG AAC TGG TCG CTC GCT GAT AAA GTA GAA ACG 
Arg Lys He Ser Trp Val Asn Trp Ser Leu Ala Asp Lys Val Glu inr 
285 290 

TCT GCT GCT CTT ATG CCA GGT GCA TCG CCT ACC GGC GCT GGA CCG ATG 
Ur AU Ala Leu Me? Pro Gly Ala Ser Pro Thr Gly Ala Gly Pro Met 
300 305 310 

CCC AAT TGT CGA ATG GGC AAA TCG GGT TCG CGA TCA AAT CCG GCA AGC 
Pro Asn Cys Arg Met Gly Lys Ser Gly Ser Arg Ser Asn Pro Ala Ser 
315 320 3Zb 

AAC TGG AGG CGG CAG GGC AAT CCA ACT GCA CCG GCT GCC CCT ACT AAC 
"n Trp Arg Arg Gin Gly Asn Pro Thr Ala Pro Ala Ala Pro Thr Asn 
330 335 

CTC TCG GCA AAC GGC GGC AAC GCC CAG GTA TCA TTA ACC TGG AAC GCA 
25 ™ Ala Z Gly Gly Asn Ala Gin Val Ser Leu Thr Trp Asn Ala 
345 350 3 

GTT AGC GGG GCG ACG AGC TAT ACC GTA AAG CGA GCA ACG ACG AGC GGC 
Val Ser Gly Ala Thr Ser Tyr Thr Val Lys Arg Ala Thr Thr Ser Gly 
365 370 



581 



629 



677 



725 



773 



821 



869 



917 



965 



1013 



1061 



1109 



1157 
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GGT CCG TAC ACG AAT GTG GAC CGG GGT GTC ACG GCG ACG AGC TAC ACG 
Gly Pro Tyr Thr Asn Val Asp Arg Gly Val Thr Ala Thr Ser Tyr Thr 
380 385 390 

AAC ACC GGG CTG ACG AAT GGC ACG ACG TAT TAT TAT GTC GTG AGG GCA 
Asn Thr Gly Leu Thr Asn Gly Thr Thr Tyr Tyr Tyr Val Val Arq Ala 
395 400 405 

TCC AAT AGC GCG GGC AGC AGC GCG AAC TCC GCG CAA GCG AGC GCA ACG 
Ser Asn Ser Ala Gly Ser Ser Ala Asn Ser Ala Gin Ala Ser Ala Thr 
410 415 420 

CCG GCT AGC GGC GGC GCC AGT ACG GGG AAC CTT GTT GTC CAA TAC AAA 
Pro Ala Ser Gly Gly Ala Ser Thr Gly Asn Leu Val Val Gin Tyr Lys 
425 430 435 440 

GTT GGC GAC ACT AGC GCC ACG GAT AAC CAA ATG AAG CCT TCC TTT AAC 
Val Gly Asp Thr Ser Ala Thr Asp Asn Gin Met Lys Pro Ser Phe Asn 
445 450 455 

ATC AAG AAC AAC GGT ACA ACC CCT GTT AAC CTG AGC GGC CTC AAG CTT 
He Lys Asn Asn Gly Thr Thr Pro Val Asn Leu Ser Gly Leu Lys Leu 
460 465 470 

NNN NNN NNN NNM NAA AAA GAC GGA CCT GCG GAT ATG AGC TGC TCG ATC 
Xaa Xaa Xaa Xaa Xaa Lys Asp Gly Pro Ala Asp Met Ser Cys Ser lie 
475 480 485 

GAC TGG GCG CAA ATC GGC CGA ACG AAT GTT CTG CTG GCA TTC GCT AAC 
Asp Trp Ala Gin lie Gly Arg Thr Asn Val Leu Leu Ala Phe Ala Asn 
490 495 500 

TTT ACC GGG AGT AAT ACG GAT ACT TAC TGT TGT GAG CTA AGC TTT AGC 
Phe Thr Gly Ser Asn Thr Asp Thr Tyr Cys Cys Glu Leu Ser Phe Ser 
505 510 515 520 

TGC ACT GCA GGT TCG TAT CCC GGC TAT GCG TGG AC 
Cys Thr Ala Gly Ser Tyr Pro Gly Tyr Ala Trp 
525 



1205 



1253 



1301 



1349 



1397 



1445 



1493 * 



1541 



1589, 



1624 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
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Met Arg He His Ala He Arg Gin Ser Cys Arg Leu Val Leu Thr Met 

Val Leu Met Leu Gly Leu Leu Leu Pro Val- Gly Ala Pro Lys Gly Tyr 
20 25 30 

Ala Ala Pro Ala Val Pro Phe Gly Gin Leu Lys Val iGln Gly Asn Gin 
35 40 * b 

Leu Val Gly Gin Ser Gly Gin Ala Val Gin Leu Val Gly Met Ser Ser 
50 . 55 60 

His Gly Leu Gin Trp Tyr Gly Asn Phe Val Asn Lys Ser Ser Leu Gin 
65 70 75 

Trp Met Arg Asp Asn Trp Gly He Asn Val Phe Arg Ala Ala Met Tyr 

85 ... . 90 33 

Thr Ser Glu Asp Gly Tyr He Thr Asp Pro Ser Val Lys Asn Lys Val 
100 105 11 

Lys Glu Ala Val Gin Ala Ser He Asp Leu Ala Leu Tyr Val He He 
' 115 120 125 

Asp Trp His He Leu Ser Asp. Gly Asn Pro Asn Thr Tyr Lys Ala Gin 
130 135 140 

Ser Lys Ala Phe Phe Gin Glu Met Ala Thr Leu Tyr Gly Asn Thr Pro 
145 150 155 lbU 

Asn Val He Tyr Glu He Ala Thr Ser Pro Thr Glu Cys Val Leu Gly 
165 170 1/3 

Arg Cys Gin Ser Ser Glu Glu Val He Thr Ala He Arg Ser He Asp 
180 185 190 

Pro Asp Gly Val Val He Val Gly Ser Pro Thr Trp Ser Gin Asp lie 
195 200 205 

His Leu Ala Ala Asp Asn Pro Val Ser His Ser Asn Val Met Tyr Ala 
210 215 220 

Leu His Phe Tyr Ser Gly Thr His Gly Gin Phe Leu Arg Asp Arg He 

225 230 235 

Thr Tyr Ala Met Asn Lys Gly Ala Ala He Phe Val Thr Glu Trp Gly 
245 250 

Thr Ser Asp Ala Ser Gly Asn Gly Gly Pro. Tyr Leu Pro Gin Ser Lys 
260 265 Z'° 
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Glu Trp He Asp Phe Leu Asn Ala Arg Lys lie Ser Trp Val Asn TrD 
275 280 285 

Ser Leu Ala Asp Lys Val Glu Thr Ser Ala Ala Leu Met Pro Gly Ala 
290 295 300 

Ser Pro Thr Gly Ala Gly Pro Met Pro Asn Cys Arg Met Gly Lys Ser 
305 310 315 320 

Gly Ser Arg Ser Asn Pro Ala Ser Asn Trp Arg Arg Gin Gly Asn Pro 
325 330 335 

Thr Ala Pro Ala Ala Pro Thr Asn Leu Ser Ala Asn Gly Gly Asn Ala 
340 345 350 

Gin Val Ser Leu Thr Trp Asn Ala Val Ser Gly Ala Thr Ser Tyr Thr 
355 - - 360 365 

Val Lys Arg Ala Thr Thr Ser Gly Gly Pro Tyr Thr Asn Val Asp Arg 
370 375 380 

Gly Val Thr Ala Thr Ser Tyr Thr Asn Thr Gly Leu Thr Asn Gly Thr 
385 390 395 400 

Thr Tyr Tyr Tyr Val Val Arg Ala Ser Asn Ser Ala Gly Ser Ser Ala 
405 410 415 

Asn Ser Ala Gin Ala Ser Ala Thr Pro Ala Ser Gly Gly Ala Ser Thr 
420 425 430 

Gly Asn Leu Val Val Gin Tyr Lys Val Gly Asp Thr Ser Ala Thr Asp 
435 440 445 

Asn Gin Met Lys Pro Ser Phe Asn He Lys Asn Asn Gly Thr Thr Pro 
450 455 460 

Val Asn Leu Ser Gly Leu Lys Leu Xaa Xaa Xaa Xaa Xaa Lys Asp Gly 
465 . 470 475 480 

Pro Ala Asp Met Ser Cys Ser He Asp Trp Ala Gin He Gly Arg Thr 
485 490 495 

Asn Val Leu Leu Ala Phe Ala Asn Phe Thr Gly Ser Asn Thr Asp Thr 
500 505 510 

Tyr Cys Cys Glu Leu Ser Phe Ser Cys Thr Ala Gly Ser Tyr Pro Gly 
515 520 525 

Tyr Ala Trp 
530 
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CLAIMS 
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1. An enzyme which exhibits cellulase activity, which enzyme is producib 
by a strain of Bacillus spp., NCIMB 40250, or a related Bacillus spp. strain 

5 or a derivative of said cellulase. j 

2. An enzyme according to claim 1, which exhibits an endoglucanase activitl 
of at least about 10, more preferably at least about 20, most preferably a! 
least about 25, such as about 30, CMC-endoase units (as defined herein) pel 
mg of total protein under alkaline conditions. 

3. An enzyme according to claim 1, which is active at a temperature of up tc 
about 65 "C. 
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20 



25 



30 



4. An enzyme according to any of claims 1-3, which is an endoglucanase with! 
an apparent molecular weight of 75 kD or a cleavage product ^thereof 
exhibiting endoglucanase activity. • • ! 

5. An. enzyme according to claim 4, which is an endoglucanase encoded by the 
DNA .sequence shown in the appended Sequence Listing IDJ1, or a modification 
thereof encoding a derivative of said endoglucanase. j 

v 
t 

6. An enzyme according to any of claims 1-3, which is an endoglucanase with 
an apparent molecular weight of 56 kD or a cleavage product thereof 
exhibiting endoglucanase activity. 

7. An enzyme according to claim 6, which is an endoglucanase encoded by the 
DNA sequence shown in the appended Sequence Listing ID#f, or a modification 
thereof encoding a derivative of said endoglucanase. 

8. An enzyme according to any of claims 1-3, which is an endoglucanase with 
an apparent molecular weight of 4 5 kD or . a cleavage product thereof 
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9. An enzyme according to claim 8, which is an endoglucanase encoded by the 
DNA sequence shown in the appended Sequence Listing ID#6, or a modification 
thereof encoding a derivative of said endogl ucanase, 

10. An enzyme according to any of claims 1-3, which is an endoglucanase with 
an apparent molecular weight of 60 kD or a cleavage product thereof 
exhibiting endoglucanase activity, or which is an endoglucanase with an 
apparent molecular weight of 56 kD or a cleavage product thereof exhibiting 
endoglucanase activity. 

11. An enzyme according to any of claims 1-3, which is an endoglucanase with 
an apparent molecular weight of 92 kD or a cleavage product thereof 
exhibiting endoglucanase activity. 



ith W ^* ^ n enz y me which comprises a core region derived from an endoglucanase 
according to any of claims 1-11 combined with a cellulose-binding domain 
derived from another cellulase enzyme, or a core region derived from another 
cellulase enzyme combined with a cellulose-binding domain derived from an 
endoglucanase according to any of claims 1-11. 

13. An enzyme according to claim 12, wherein the core region is derived from 
a cellulase enzyme which does not, in nature, comprise a cellulose-binding 
domain. 

25 14. A DNA construct which comprises a DNA sequence encoding an enzyme 
exhibiting cellulase activity, which enzyme is derivable from a strain of 
Bacillus spp., NCIMB 40250, or a related Bacillus spp. strain, or a 
derivative of said cellulase. 



30 15. A DNA construct according to claim 14, wherein the DNA sequence encodes 
an endoglucanase with an apparent molecular weight of 75 kD. 

16. A DNA construct according to claim 15, wherein the DNA sequence is the 
one shown in the appended Sequence Listing ID#1, or a modification thereof 
i?5 encoding a derivative of said endoglucanase. 
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17. A DNA construct according to claim 14, wherein the DNA sequence encod 
an endoglucanase with an apparent molecular weight of 56 kD. 



30 
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28. A it 

18. A DNA construct according to claim 17, wherein the DNA sequence is thf a h ° St 
one shown in the appended Sequence Listing ID#3, or a modification thereo] conduc 
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cultur 

29. A 
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30. A 
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31. . 
more 
more 
wit* 

32. 
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encoding a derivative of said endoglucanase. 

19. A DNA construct according to claim 14, wherein the DNA sequence encode 
10 an endoglucanase with an apparent molecular weight of 45 kD. 

. - 20. A DNA construct -according to claim 19, wherein the DNA sequence is the 
one shown in the appended Sequence Listing ID#6, or a modification thereof 
encoding a derivative of said endoglucanase 

15 

21. A DNA construct according to claim 14, wherein the DNA sequence encodes 
an endoglucanase with an apparent molecular weight of 92 kD. 

22. An expression vector which carries an inserted DNA construct according 
20 to any of claims 14-21, '• &° 



23. A cell which is transformed with a DNA construct according to any of 
claims 14-21 or with an expression vector according to claim 22. 



25 24. A cell according to claim 23, which is a bacterium. 

25. A cell according to claim 24, which is a grampositive bacteri 



urn. 



26. A cell according to claim 25, wherein the grampositive bacterium is 
selected from the group consisting of Bacillus subtilis . Racillus 
licheniformis T Bacillus lentus . Bacillus brevis . Bacillus stearothertnophilus , 
Bacillus alkalophilus . Bacillus amvloliouefaciens . Bacillus coaoulans , 
Bacillus circulans or Bacillus lautus. 
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27. A cell according to claim 24, which is a gramnegative bacterium, e.g. 
Escherichia col i . 



codi 
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reoj 



28. A method of producing an enzyme according to any of claims 1-13, wherein 
a host cell according to any of claims 23-27 is cultured under conditions 
conducive to the production of the endoglucanase or a derivative thereof, and 
the endoglucanase or derivative thereof is subsequently recovered from the 
culture. 
d es l- 

£o 29. A method according to claim 28, wherein the endoglucanase is recovered 
in mature form. 

30. A cellulolytic agent capable of degrading amorphous regions of cellulose 
fibres, the agent comprising an enzyme according to any of claims 1-13. 

31. An agent according to claim 30, which comprises a combination of two or 
more cellulases recited in any of claims 1-13, or a combination of one or 
more cellulases recited in any of claims 1-13 with one or more other enzymes 
with cellulase. activity. 

32. An agent according to claim 30 or 31, which is in the form of a non- 
dusting granulate, stabilized liquid or protected enzyme. 

33. An agent according to any of claims 30-32, which exhibits an 
25 endoglucanase activity of 500-10,000 CMC-endoase units per gram of the agent. 

| 34. An agent according to any of claims 30-33, which is a detergent additive. 

35. An agent according to claim 34, which additionally comprises another 
30 enzyme such a a protease, lipase and/or amylase. 

36. A detergent composition comprising a cellulolytic agent according to any 
of claims 30-35. 
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37. A detergent composition according to claim 36, which exhibits 
endoglucanase activity of 0.3-400 CMC-endoase units per gram of detergenl 

38. A method of reducing the rate at which cellulose-containing fabril 
become harsh or of reducing the harshness of cellulose-containing fabri< 
the method comprising treating cellulose-containing fabrics with 
cellulolytic agent according to any of claims 30-35. 

39. A method according to claim 38, wherein the treatment of the fabrics wil 
the cellulolytic agent is conducted during soaking, washing or rinsing of tf 
fabrics. 



15 



40. A method of treating a coloured, cellulose-containing fabric in order t| 
provide colour clarification, the method comprising treating the cellulose] 
containing fabric with a cellulolytic agent according to any of claims 30^ 
35. 



20 



25 



41. A method according to claim 40, wherein the treatment of the fabric with] 
the cellulolytjc agent is conducted in an aqueous medium during soaking,] 
washing or rinsing of the fabric. 



42. A method according to claim 41, wherein the aqueous medium exhibits ani 
endoglucanase activity of more than about 250 CMC-endoase units per liter of; 
the aqueous medium. ' 
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